Abstract
The study of immune cellular composition has been of great scientific interest in immunology because of the generation of multiple large-scale data. From the statistical point of view, such immune cellular data should be treated as compositional. In compositional data, each element is positive, and all the elements sum to a constant, which can be set to one in general. Standard statistical methods are not directly applicable for the analysis of compositional data because they do not appropriately handle correlations between the compositional elements. In this paper, we review statistical methods for compositional data analysis and illustrate them in the context of immunology. Specifically, we focus on regression analyses using log-ratio transformations and the alternative approach using Dirichlet regression analysis, discuss their theoretical foundations, and illustrate their applications with immune cellular fraction data generated from colorectal cancer patients.
Original language | English (US) |
---|---|
Pages (from-to) | 453-469 |
Number of pages | 17 |
Journal | Communications for Statistical Applications and Methods |
Volume | 29 |
Issue number | 4 |
DOIs | |
State | Published - 2022 |
Funding
This work was supported by the National Institutes of Health (grant numbers R01-GM122078, R21-CA209848, U01-DA045300) awarded to Dongjun Chung,, and the Human Resources Program in Energy Technology of the Korea Institute of Energy Technology Evaluation and Planning(KETEP) granted financial resource from the Ministry of Trade, Industry & Energy, Republic of Korea (No. 20204010600060) awarded to Young Min Kim. The funders had no role in the study design, data collection, and analysis, decision to publish, or preparation of the manuscript. 1 Corresponding author: Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA. E-mail: [email protected] 2Corresponding author: Department of Statistics, Kyungpook National University, Daegu, 41566, South Korea. E-mail: [email protected]
Keywords
- Compositional data
- Compositional regression
- Dirichlet regression
- Immuno-oncology
- Immunology
- Log-ratio transformation
ASJC Scopus subject areas
- Statistics and Probability
- Modeling and Simulation
- Finance
- Statistics, Probability and Uncertainty
- Applied Mathematics