TY - JOUR
T1 - Machine Learning-based Prediction Models for Diagnosis and Prognosis in Inflammatory Bowel Diseases
T2 - A Systematic Review
AU - Nguyen, Nghia H.
AU - Picetti, Dominic
AU - Dulai, Parambir S.
AU - Jairath, Vipul
AU - Sandborn, William J.
AU - Ohno-Machado, Lucila
AU - Chen, Peter L.
AU - Singh, Siddharth
N1 - Funding Information:
No direct funding was used for this project. NHN is supported by the National Institute of Diabetes and Digestive and Kidney Diseases of the National Institutes of Health under award number T32DK007202 and T15LM01127WJS is supported in part by NIDDK-funded San Diego Digestive Diseases Research Centre [P30 DK120515]. SS is supported by the National Institute of Diabetes and Digestive and Kidney Diseases of the National Institutes of Health under award number K23DK117058.
Publisher Copyright:
© 2021 The Author(s).
PY - 2022/3/1
Y1 - 2022/3/1
N2 - Background and Aims: There is increasing interest in machine learning-based prediction models in inflammatory bowel diseases [IBD]. We synthesised and critically appraised studies comparing machine learning vs traditional statistical models, using routinely available clinical data for risk prediction in IBD. Methods: Through a systematic review till January 1, 2021, we identified cohort studies that derived and/or validated machine learning models, based on routinely collected clinical data in patients with IBD, to predict the risk of harbouring or developing adverse clinical outcomes, and reported its predictive performance against a traditional statistical model for the same outcome. We appraised the risk of bias in these studies using the Prediction model Risk of Bias ASsessment [PROBAST] tool. Results: We included 13 studies on machine learning-based prediction models in IBD, encompassing themes of predicting treatment response to biologics and thiopurines and predicting longitudinal disease activity and complications and outcomes in patients with acute severe ulcerative colitis. The most common machine learning models used were tree-based algorithms, which are classification approaches achieved through supervised learning. Machine learning models outperformed traditional statistical models in risk prediction. However, most models were at high risk of bias, and only one was externally validated. Conclusions: Machine learning-based prediction models based on routinely collected data generally perform better than traditional statistical models in risk prediction in IBD, though frequently have high risk of bias. Future studies examining these approaches are warranted, with special focus on external validation and clinical applicability.
AB - Background and Aims: There is increasing interest in machine learning-based prediction models in inflammatory bowel diseases [IBD]. We synthesised and critically appraised studies comparing machine learning vs traditional statistical models, using routinely available clinical data for risk prediction in IBD. Methods: Through a systematic review till January 1, 2021, we identified cohort studies that derived and/or validated machine learning models, based on routinely collected clinical data in patients with IBD, to predict the risk of harbouring or developing adverse clinical outcomes, and reported its predictive performance against a traditional statistical model for the same outcome. We appraised the risk of bias in these studies using the Prediction model Risk of Bias ASsessment [PROBAST] tool. Results: We included 13 studies on machine learning-based prediction models in IBD, encompassing themes of predicting treatment response to biologics and thiopurines and predicting longitudinal disease activity and complications and outcomes in patients with acute severe ulcerative colitis. The most common machine learning models used were tree-based algorithms, which are classification approaches achieved through supervised learning. Machine learning models outperformed traditional statistical models in risk prediction. However, most models were at high risk of bias, and only one was externally validated. Conclusions: Machine learning-based prediction models based on routinely collected data generally perform better than traditional statistical models in risk prediction in IBD, though frequently have high risk of bias. Future studies examining these approaches are warranted, with special focus on external validation and clinical applicability.
KW - Crohn's disease
KW - Machine learning
KW - big data
KW - prediction
KW - ulcerative colitis
UR - http://www.scopus.com/inward/record.url?scp=85126490941&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85126490941&partnerID=8YFLogxK
U2 - 10.1093/ecco-jcc/jjab155
DO - 10.1093/ecco-jcc/jjab155
M3 - Article
C2 - 34492100
AN - SCOPUS:85126490941
SN - 1873-9946
VL - 16
SP - 398
EP - 413
JO - Journal of Crohn's and Colitis
JF - Journal of Crohn's and Colitis
IS - 3
ER -