TY - JOUR
T1 - Improved Statistical Methods Enable Greater Sensitivity in Rhythm Detection for Genome-Wide Data
AU - Hutchison, Alan L.
AU - Maienschein-Cline, Mark
AU - Chiang, Andrew H.
AU - Tabei, S. M.Ali
AU - Gudjonson, Herman
AU - Bahroos, Neil
AU - Allada, Ravi
AU - Dinner, Aaron R.
N1 - Publisher Copyright:
© 2015 Hutchison et al.
PY - 2015/3/20
Y1 - 2015/3/20
N2 - Robust methods for identifying patterns of expression in genome-wide data are important for generating hypotheses regarding gene function. To this end, several analytic methods have been developed for detecting periodic patterns. We improve one such method, JTK_CYCLE, by explicitly calculating the null distribution such that it accounts for multiple hypothesis testing and by including non-sinusoidal reference waveforms. We term this method empirical JTK_CYCLE with asymmetry search, and we compare its performance to JTK_CYCLE with Bonferroni and Benjamini-Hochberg multiple hypothesis testing correction, as well as to five other methods: cyclohedron test, address reduction, stable persistence, ANOVA, and F24. We find that ANOVA, F24, and JTK_CYCLE consistently outperform the other three methods when data are limited and noisy; empirical JTK_CYCLE with asymmetry search gives the greatest sensitivity while controlling for the false discovery rate. Our analysis also provides insight into experimental design and we find that, for a fixed number of samples, better sensitivity and specificity are achieved with higher numbers of replicates than with higher sampling density. Application of the methods to detecting circadian rhythms in a metadataset of microarrays that quantify time-dependent gene expression in whole heads of Drosophila melanogaster reveals annotations that are enriched among genes with highly asymmetric waveforms. These include a wide range of oxidation reduction and metabolic genes, as well as genes with transcripts that have multiple splice forms.
AB - Robust methods for identifying patterns of expression in genome-wide data are important for generating hypotheses regarding gene function. To this end, several analytic methods have been developed for detecting periodic patterns. We improve one such method, JTK_CYCLE, by explicitly calculating the null distribution such that it accounts for multiple hypothesis testing and by including non-sinusoidal reference waveforms. We term this method empirical JTK_CYCLE with asymmetry search, and we compare its performance to JTK_CYCLE with Bonferroni and Benjamini-Hochberg multiple hypothesis testing correction, as well as to five other methods: cyclohedron test, address reduction, stable persistence, ANOVA, and F24. We find that ANOVA, F24, and JTK_CYCLE consistently outperform the other three methods when data are limited and noisy; empirical JTK_CYCLE with asymmetry search gives the greatest sensitivity while controlling for the false discovery rate. Our analysis also provides insight into experimental design and we find that, for a fixed number of samples, better sensitivity and specificity are achieved with higher numbers of replicates than with higher sampling density. Application of the methods to detecting circadian rhythms in a metadataset of microarrays that quantify time-dependent gene expression in whole heads of Drosophila melanogaster reveals annotations that are enriched among genes with highly asymmetric waveforms. These include a wide range of oxidation reduction and metabolic genes, as well as genes with transcripts that have multiple splice forms.
UR - http://www.scopus.com/inward/record.url?scp=84926337098&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84926337098&partnerID=8YFLogxK
U2 - 10.1371/journal.pcbi.1004094
DO - 10.1371/journal.pcbi.1004094
M3 - Article
C2 - 25793520
AN - SCOPUS:84926337098
SN - 1553-734X
VL - 11
JO - PLoS computational biology
JF - PLoS computational biology
IS - 3
M1 - e1004094
ER -