TY - JOUR
T1 - RWEN
T2 - Response-weighted elastic net for prediction of chemosensitivity of cancer cell lines
AU - Basu, Amrita
AU - Mitra, Ritwik
AU - Liu, Han
AU - Schreiber, Stuart L.
AU - Clemons, Paul A.
N1 - Funding Information:
This work was supported by the National Institutes of Health (NIH) via the National Cancer Institute (NCI) Cancer Target Discovery and Development Network [grant number U01CA217848]. S.L.S. is an Investigator at the Howard Hughes Medical Institute.
Publisher Copyright:
© The Author(s) 2018.
PY - 2018/10/1
Y1 - 2018/10/1
N2 - Motivation: In recent years there have been several efforts to generate sensitivity profiles of collections of genomically characterized cell lines to panels of candidate therapeutic compounds. These data provide the basis for the development of in silico models of sensitivity based on cellular, genetic, or expression biomarkers of cancer cells. However, a remaining challenge is an efficient way to identify accurate sets of biomarkers to validate. To address this challenge, we developed methodology using gene-expression profiles of human cancer cell lines to predict the responses of these cell lines to a panel of compounds. Results: We developed an iterative weighting scheme which, when applied to elastic net, a regularized regression method, significantly improves the overall accuracy of predictions, particularly in the highly sensitive response region. In addition to application of these methods to actual chemical sensitivity data, we investigated the effects of sample size, number of features, model sparsity, signal-to-noise ratio, andfeature correlation on predictive performance using a simulation framework, particularly for situations where the number of covariates is much larger than sample size. While our method aims to be useful in therapeutic discovery and understanding of the basic mechanisms of action of drugs and their targets, it is generally applicable in any domain where predictions of extreme responses are of highest importance.
AB - Motivation: In recent years there have been several efforts to generate sensitivity profiles of collections of genomically characterized cell lines to panels of candidate therapeutic compounds. These data provide the basis for the development of in silico models of sensitivity based on cellular, genetic, or expression biomarkers of cancer cells. However, a remaining challenge is an efficient way to identify accurate sets of biomarkers to validate. To address this challenge, we developed methodology using gene-expression profiles of human cancer cell lines to predict the responses of these cell lines to a panel of compounds. Results: We developed an iterative weighting scheme which, when applied to elastic net, a regularized regression method, significantly improves the overall accuracy of predictions, particularly in the highly sensitive response region. In addition to application of these methods to actual chemical sensitivity data, we investigated the effects of sample size, number of features, model sparsity, signal-to-noise ratio, andfeature correlation on predictive performance using a simulation framework, particularly for situations where the number of covariates is much larger than sample size. While our method aims to be useful in therapeutic discovery and understanding of the basic mechanisms of action of drugs and their targets, it is generally applicable in any domain where predictions of extreme responses are of highest importance.
UR - http://www.scopus.com/inward/record.url?scp=85054059669&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054059669&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/bty199
DO - 10.1093/bioinformatics/bty199
M3 - Article
C2 - 29688307
AN - SCOPUS:85054059669
SN - 1367-4803
VL - 34
SP - 3332
EP - 3339
JO - Bioinformatics
JF - Bioinformatics
IS - 19
ER -