TY - JOUR
T1 - Machine learning as a strategy to account for dietary synergy
T2 - An illustration based on dietary intake and adverse pregnancy outcomes
AU - Bodnar, Lisa M.
AU - Cartus, Abigail R.
AU - Kirkpatrick, Sharon I.
AU - Himes, Katherine P.
AU - Kennedy, Edward H.
AU - Simhan, Hyagriv N.
AU - Grobman, William A.
AU - Duffy, Jennifer Y.
AU - Silver, Robert M.
AU - Parry, Samuel
AU - Naimi, Ashley I.
N1 - Publisher Copyright:
Copyright © The Author(s) on behalf of the American Society for Nutrition 2020.
PY - 2020/6/1
Y1 - 2020/6/1
N2 - Conventional analytic approaches for studying diet patterns assume no dietary synergy, which can lead to bias if incorrectly modeled. Machine learning algorithms can overcome these limitations. Objectives: We estimated associations between fruit and vegetable intake relative to total energy intake and adverse pregnancy outcomes using targeted maximum likelihood estimation (TMLE) paired with the ensemble machine learning algorithm Super Learner, and compared these with results generated from multivariable logistic regression. Methods: We used data from 7572 women in the Nulliparous Pregnancy Outcomes Study: monitoring mothers-to-be. Usual daily periconceptional intake of total fruits and total vegetables was estimated from an FFQ. We calculated the marginal risk of preterm birth, small-for-gestational-age (SGA) birth, gestational diabetes, and pre-eclampsia according to density of fruits and vegetables (cups/1000 kcal) ≥80th percentile compared with <80th percentile using multivariable logistic regression and Super Learner with TMLE. Models were adjusted for confounders, including other Healthy Eating Index-2010 components. Results: Using logistic regression, higher fruit and high vegetable densities were associated with 1.1% and 1.4% reductions in pre-eclampsia risk compared with lower densities, respectively. They were not associated with the 3 other outcomes. Using Super Learner with TMLE, high fruit and vegetable densities were associated with fewer cases of preterm birth (-4.0; 95% CI: -4.9, -3.0 and -3.7; 95% CI: -5.0, -2.3), SGA (-1.7; 95% CI: -2.9, -0.51 and -3.8; 95% CI: -5.0, -2.5), and pre-eclampsia (-3.2; 95% CI: -4.2, -2.2 and -4.0; 95% CI: -5.2, -2.7) per 100 births, respectively, and high vegetable densities were associated with a 0.9% increase in risk of gestational diabetes. Conclusions: The differences in results between Super Learner with TMLE and logistic regression suggest that dietary synergy, which is accounted for in machine learning, may play a role in pregnancy outcomes. This innovative methodology for analyzing dietary data has the potential to advance the study of diet patterns.
AB - Conventional analytic approaches for studying diet patterns assume no dietary synergy, which can lead to bias if incorrectly modeled. Machine learning algorithms can overcome these limitations. Objectives: We estimated associations between fruit and vegetable intake relative to total energy intake and adverse pregnancy outcomes using targeted maximum likelihood estimation (TMLE) paired with the ensemble machine learning algorithm Super Learner, and compared these with results generated from multivariable logistic regression. Methods: We used data from 7572 women in the Nulliparous Pregnancy Outcomes Study: monitoring mothers-to-be. Usual daily periconceptional intake of total fruits and total vegetables was estimated from an FFQ. We calculated the marginal risk of preterm birth, small-for-gestational-age (SGA) birth, gestational diabetes, and pre-eclampsia according to density of fruits and vegetables (cups/1000 kcal) ≥80th percentile compared with <80th percentile using multivariable logistic regression and Super Learner with TMLE. Models were adjusted for confounders, including other Healthy Eating Index-2010 components. Results: Using logistic regression, higher fruit and high vegetable densities were associated with 1.1% and 1.4% reductions in pre-eclampsia risk compared with lower densities, respectively. They were not associated with the 3 other outcomes. Using Super Learner with TMLE, high fruit and vegetable densities were associated with fewer cases of preterm birth (-4.0; 95% CI: -4.9, -3.0 and -3.7; 95% CI: -5.0, -2.3), SGA (-1.7; 95% CI: -2.9, -0.51 and -3.8; 95% CI: -5.0, -2.5), and pre-eclampsia (-3.2; 95% CI: -4.2, -2.2 and -4.0; 95% CI: -5.2, -2.7) per 100 births, respectively, and high vegetable densities were associated with a 0.9% increase in risk of gestational diabetes. Conclusions: The differences in results between Super Learner with TMLE and logistic regression suggest that dietary synergy, which is accounted for in machine learning, may play a role in pregnancy outcomes. This innovative methodology for analyzing dietary data has the potential to advance the study of diet patterns.
KW - birth
KW - dietary patterns
KW - machine learning
KW - pregnancy
KW - pregnant women
KW - synergy
UR - http://www.scopus.com/inward/record.url?scp=85085903819&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85085903819&partnerID=8YFLogxK
U2 - 10.1093/ajcn/nqaa027
DO - 10.1093/ajcn/nqaa027
M3 - Article
C2 - 32108865
AN - SCOPUS:85085903819
SN - 0002-9165
VL - 111
SP - 1235
EP - 1243
JO - American Journal of Clinical Nutrition
JF - American Journal of Clinical Nutrition
IS - 6
ER -