TY - JOUR
T1 - SECIMTools
T2 - A suite of metabolomics data analysis tools
AU - Kirpich, Alexander S.
AU - Ibarra, Miguel
AU - Moskalenko, Oleksandr
AU - Fear, Justin M.
AU - Gerken, Joseph
AU - Mi, Xinlei
AU - Ashrafi, Ali
AU - Morse, Alison M.
AU - McIntyre, Lauren M.
N1 - Publisher Copyright:
© 2018 The Author(s).
PY - 2018/4/20
Y1 - 2018/4/20
N2 - Background: Metabolomics has the promise to transform the area of personalized medicine with the rapid development of high throughput technology for untargeted analysis of metabolites. Open access, easy to use, analytic tools that are broadly accessible to the biological community need to be developed. While technology used in metabolomics varies, most metabolomics studies have a set of features identified. Galaxy is an open access platform that enables scientists at all levels to interact with big data. Galaxy promotes reproducibility by saving histories and enabling the sharing workflows among scientists. Results: SECIMTools (SouthEast Center for Integrated Metabolomics) is a set of Python applications that are available both as standalone tools and wrapped for use in Galaxy. The suite includes a comprehensive set of quality control metrics (retention time window evaluation and various peak evaluation tools), visualization techniques (hierarchical cluster heatmap, principal component analysis, modular modularity clustering), basic statistical analysis methods (partial least squares - discriminant analysis, analysis of variance, t-test, Kruskal-Wallis non-parametric test), advanced classification methods (random forest, support vector machines), and advanced variable selection tools (least absolute shrinkage and selection operator LASSO and Elastic Net). Conclusions: SECIMTools leverages the Galaxy platform and enables integrated workflows for metabolomics data analysis made from building blocks designed for easy use and interpretability. Standard data formats and a set of utilities allow arbitrary linkages between tools to encourage novel workflow designs. The Galaxy framework enables future data integration for metabolomics studies with other omics data.
AB - Background: Metabolomics has the promise to transform the area of personalized medicine with the rapid development of high throughput technology for untargeted analysis of metabolites. Open access, easy to use, analytic tools that are broadly accessible to the biological community need to be developed. While technology used in metabolomics varies, most metabolomics studies have a set of features identified. Galaxy is an open access platform that enables scientists at all levels to interact with big data. Galaxy promotes reproducibility by saving histories and enabling the sharing workflows among scientists. Results: SECIMTools (SouthEast Center for Integrated Metabolomics) is a set of Python applications that are available both as standalone tools and wrapped for use in Galaxy. The suite includes a comprehensive set of quality control metrics (retention time window evaluation and various peak evaluation tools), visualization techniques (hierarchical cluster heatmap, principal component analysis, modular modularity clustering), basic statistical analysis methods (partial least squares - discriminant analysis, analysis of variance, t-test, Kruskal-Wallis non-parametric test), advanced classification methods (random forest, support vector machines), and advanced variable selection tools (least absolute shrinkage and selection operator LASSO and Elastic Net). Conclusions: SECIMTools leverages the Galaxy platform and enables integrated workflows for metabolomics data analysis made from building blocks designed for easy use and interpretability. Standard data formats and a set of utilities allow arbitrary linkages between tools to encourage novel workflow designs. The Galaxy framework enables future data integration for metabolomics studies with other omics data.
UR - http://www.scopus.com/inward/record.url?scp=85045643189&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85045643189&partnerID=8YFLogxK
U2 - 10.1186/s12859-018-2134-1
DO - 10.1186/s12859-018-2134-1
M3 - Article
C2 - 29678131
AN - SCOPUS:85045643189
SN - 1471-2105
VL - 19
JO - BMC bioinformatics
JF - BMC bioinformatics
IS - 1
M1 - 151
ER -