Accurate and Fast Multiple-Testing Correction in eQTL Studies

Jae Hoon Sul, Towfique Raj, Simone de Jong, Paul I.W. de Bakker, Soumya Raychaudhuri, Roel A. Ophoff, Barbara Elaine Stranger, Eleazar Eskin*, Buhm Han

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

17 Scopus citations

Abstract

In studies of expression quantitative trait loci (eQTLs), it is of increasing interest to identify eGenes, the genes whose expression levels are associated with variation at a particular genetic variant. Detecting eGenes is important for follow-up analyses and prioritization because genes are the main entities in biological processes. To detect eGenes, one typically focuses on the genetic variant with the minimum p value among all variants in cis with a gene and corrects for multiple testing to obtain a gene-level p value. For performing multiple-testing correction, a permutation test is widely used. Because of growing sample sizes of eQTL studies, however, the permutation test has become a computational bottleneck in eQTL studies. In this paper, we propose an efficient approach for correcting for multiple testing and assess eGene p values by utilizing a multivariate normal distribution. Our approach properly takes into account the linkage-disequilibrium structure among variants, and its time complexity is independent of sample size. By applying our small-sample correction techniques, our method achieves high accuracy in both small and large studies. We have shown that our method consistently produces extremely accurate p values (accuracy > 98%) for three human eQTL datasets with different sample sizes and SNP densities: the Genotype-Tissue Expression pilot dataset, the multi-region brain dataset, and the HapMap 3 dataset.

Original languageEnglish (US)
Pages (from-to)857-868
Number of pages12
JournalAmerican journal of human genetics
Volume96
Issue number6
DOIs
StatePublished - May 1 2015

ASJC Scopus subject areas

  • Genetics(clinical)
  • Genetics

Fingerprint

Dive into the research topics of 'Accurate and Fast Multiple-Testing Correction in eQTL Studies'. Together they form a unique fingerprint.

Cite this