TY - JOUR
T1 - Accurately quantifying low-abundant targets amid similar sequences by revealing hidden correlations in oligonucleotide microarray data
AU - Marcelino, Luisa A.
AU - Backman, Vadim
AU - Donaldson, Andres
AU - Steadman, Claudia
AU - Thompson, Janelle R.
AU - Preheim, Sarah Pacocha
AU - Lien, Cynthia
AU - Lim, Eelin
AU - Veneziano, Daniele
AU - Polz, Martin F.
PY - 2006/9/12
Y1 - 2006/9/12
N2 - Microarrays have enabled the determination of how thousands of genes are expressed to coordinate function within single organisms. Yet applications to natural or engineered communities where different organisms interact to produce complex properties are hampered by theoretical and technological limitations. Here we describe a general method to accurately identify low-abundant targets in systems containing complex mixtures of homologous targets. We combined an analytical predictor of nonspecific probe-target interactions (cross-hybridization) with an optimization algorithm that iteratively deconvolutes true probe-target signal from raw signal affected by spurious contributions (cross-hybridization, noise, background, and unequal specific hybridization response). The method was capable of quantifying, with unprecedented specificity and accuracy, ribosomal RNA (rRNA) sequences in artificial and natural communities. Controlled experiments with spiked rRNA into artificial and natural communities demonstrated the accuracy of identification and quantitative behavior over different concentration ranges. Finally, we illustrated the power of this methodology for accurate detection of low-abundant targets in natural communities. We accurately identified Vibrio taxa in coastal marine samples at their natural concentrations (<0.05% of total bacteria), despite the high potential for cross-hybridization by hundreds of different coexisting rRNAs, suggesting this methodology should be expandable to any microarray platform and system requiring accurate identification of low-abundant targets amid pools of similar sequences.
AB - Microarrays have enabled the determination of how thousands of genes are expressed to coordinate function within single organisms. Yet applications to natural or engineered communities where different organisms interact to produce complex properties are hampered by theoretical and technological limitations. Here we describe a general method to accurately identify low-abundant targets in systems containing complex mixtures of homologous targets. We combined an analytical predictor of nonspecific probe-target interactions (cross-hybridization) with an optimization algorithm that iteratively deconvolutes true probe-target signal from raw signal affected by spurious contributions (cross-hybridization, noise, background, and unequal specific hybridization response). The method was capable of quantifying, with unprecedented specificity and accuracy, ribosomal RNA (rRNA) sequences in artificial and natural communities. Controlled experiments with spiked rRNA into artificial and natural communities demonstrated the accuracy of identification and quantitative behavior over different concentration ranges. Finally, we illustrated the power of this methodology for accurate detection of low-abundant targets in natural communities. We accurately identified Vibrio taxa in coastal marine samples at their natural concentrations (<0.05% of total bacteria), despite the high potential for cross-hybridization by hundreds of different coexisting rRNAs, suggesting this methodology should be expandable to any microarray platform and system requiring accurate identification of low-abundant targets amid pools of similar sequences.
KW - Cross-hybridization
KW - Free energy
KW - Microbial ecology
KW - Optimization algorithm
KW - rRNA
UR - http://www.scopus.com/inward/record.url?scp=33748765554&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33748765554&partnerID=8YFLogxK
U2 - 10.1073/pnas.0601476103
DO - 10.1073/pnas.0601476103
M3 - Article
C2 - 16950880
AN - SCOPUS:33748765554
SN - 0027-8424
VL - 103
SP - 13629
EP - 13634
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 37
ER -