TY - JOUR
T1 - Gene-environment interactions in genome-wide association studies
T2 - A comparative study of tests applied to empirical studies of type 2 diabetes
AU - Cornelis, Marilyn C.
AU - Tchetgen, Eric J Tchetgen
AU - Liang, Liming
AU - Qi, Lu
AU - Chatterjee, Nilanjan
AU - Hu, Frank B.
AU - Kraft, Peter
PY - 2012/2/1
Y1 - 2012/2/1
N2 - The question of which statistical approach is the most effective for investigating gene-environment (G-E) interactions in the context of genome-wide association studies (GWAS) remains unresolved. By using 2 case-control GWAS (the Nurses' Health Study, 1976-2006, and the Health Professionals Follow-up Study, 1986-2006) of type 2 diabetes, the authors compared 5 tests for interactions: standard logistic regression-based case-control; case-only; semiparametric maximum-likelihood estimation of an empirical-Bayes shrinkage estimator; and 2-stage tests. The authors also compared 2 joint tests of genetic main effects and G-E interaction. Elevated body mass index was the exposure of interest and was modeled as a binary trait to avoid an inflated type I error rate that the authors observed when the main effect of continuous body mass index was misspecified. Although both the case-only and the semiparametric maximum-likelihood estimation approaches assume that the tested markers are independent of exposure in the general population, the authors did not observe any evidence of inflated type I error for these tests in their studies with 2,199 cases and 3,044 controls. Both joint tests detected markers with known marginal effects. Loci with the most significant G-E interactions using the standard, empirical-Bayes, and 2-stage tests were strongly correlated with the exposure among controls. Study findings suggest that methods exploiting G-E independence can be efficient and valid options for investigating G-E interactions in GWAS.
AB - The question of which statistical approach is the most effective for investigating gene-environment (G-E) interactions in the context of genome-wide association studies (GWAS) remains unresolved. By using 2 case-control GWAS (the Nurses' Health Study, 1976-2006, and the Health Professionals Follow-up Study, 1986-2006) of type 2 diabetes, the authors compared 5 tests for interactions: standard logistic regression-based case-control; case-only; semiparametric maximum-likelihood estimation of an empirical-Bayes shrinkage estimator; and 2-stage tests. The authors also compared 2 joint tests of genetic main effects and G-E interaction. Elevated body mass index was the exposure of interest and was modeled as a binary trait to avoid an inflated type I error rate that the authors observed when the main effect of continuous body mass index was misspecified. Although both the case-only and the semiparametric maximum-likelihood estimation approaches assume that the tested markers are independent of exposure in the general population, the authors did not observe any evidence of inflated type I error for these tests in their studies with 2,199 cases and 3,044 controls. Both joint tests detected markers with known marginal effects. Loci with the most significant G-E interactions using the standard, empirical-Bayes, and 2-stage tests were strongly correlated with the exposure among controls. Study findings suggest that methods exploiting G-E independence can be efficient and valid options for investigating G-E interactions in GWAS.
KW - case study
KW - case-control studies
KW - diabetes mellitus, type 2
KW - epidemiologic methods
KW - genome-wide association study
KW - genotype-environment interaction
UR - http://www.scopus.com/inward/record.url?scp=84856155219&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84856155219&partnerID=8YFLogxK
U2 - 10.1093/aje/kwr368
DO - 10.1093/aje/kwr368
M3 - Article
C2 - 22199026
AN - SCOPUS:84856155219
SN - 0002-9262
VL - 175
SP - 191
EP - 202
JO - American Journal of Epidemiology
JF - American Journal of Epidemiology
IS - 3
ER -