TY - JOUR
T1 - The relationship between text message sentiment and self-reported depression
AU - Liu, Tony
AU - Meyerhoff, Jonah
AU - Eichstaedt, Johannes C.
AU - Karr, Chris J.
AU - Kaiser, Susan M.
AU - Kording, Konrad P.
AU - Mohr, David C.
AU - Ungar, Lyle H.
N1 - Funding Information:
This work was supported by a grant from the National Institute of Mental Health ( 5R01MH111610 ) to David C. Mohr and Konrad Kording. Jonah Meyerhoff is supported by a grant from the National Institute of Mental Health ( T32 MH115882 ). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Funding Information:
This work was supported by a grant from the National Institute of Mental Health (5R01MH111610) to David C. Mohr and Konrad Kording. Jonah Meyerhoff is supported by a grant from the National Institute of Mental Health (T32 MH115882). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Publisher Copyright:
© 2021
PY - 2022/4/1
Y1 - 2022/4/1
N2 - Background: Personal sensing has shown promise for detecting behavioral correlates of depression, but there is little work examining personal sensing of cognitive and affective states. Digital language, particularly through personal text messages, is one source that can measure these markers. Methods: We correlated privacy-preserving sentiment analysis of text messages with self-reported depression symptom severity. We enrolled 219 U.S. adults in a 16 week longitudinal observational study. Participants installed a personal sensing app on their phones, which administered self-report PHQ-8 assessments of their depression severity, collected phone sensor data, and computed anonymized language sentiment scores from their text messages. We also trained machine learning models for predicting end-of-study self-reported depression status using on blocks of phone sensor and text features. Results: In correlation analyses, we find that degrees of depression, emotional, and personal pronoun language categories correlate most strongly with self-reported depression, validating prior literature. Our classification models which predict binary depression status achieve a leave-one-out AUC of 0.72 when only considering text features and 0.76 when combining text with other networked smartphone sensors. Limitations: Participants were recruited from a panel that over-represented women, caucasians, and individuals with self-reported depression at baseline. As language use differs across demographic factors, generalizability beyond this population may be limited. The study period also coincided with the initial COVID-19 outbreak in the United States, which may have affected smartphone sensor data quality. Conclusions: Effective depression prediction through text message sentiment, especially when combined with other personal sensors, could enable comprehensive mental health monitoring and intervention.
AB - Background: Personal sensing has shown promise for detecting behavioral correlates of depression, but there is little work examining personal sensing of cognitive and affective states. Digital language, particularly through personal text messages, is one source that can measure these markers. Methods: We correlated privacy-preserving sentiment analysis of text messages with self-reported depression symptom severity. We enrolled 219 U.S. adults in a 16 week longitudinal observational study. Participants installed a personal sensing app on their phones, which administered self-report PHQ-8 assessments of their depression severity, collected phone sensor data, and computed anonymized language sentiment scores from their text messages. We also trained machine learning models for predicting end-of-study self-reported depression status using on blocks of phone sensor and text features. Results: In correlation analyses, we find that degrees of depression, emotional, and personal pronoun language categories correlate most strongly with self-reported depression, validating prior literature. Our classification models which predict binary depression status achieve a leave-one-out AUC of 0.72 when only considering text features and 0.76 when combining text with other networked smartphone sensors. Limitations: Participants were recruited from a panel that over-represented women, caucasians, and individuals with self-reported depression at baseline. As language use differs across demographic factors, generalizability beyond this population may be limited. The study period also coincided with the initial COVID-19 outbreak in the United States, which may have affected smartphone sensor data quality. Conclusions: Effective depression prediction through text message sentiment, especially when combined with other personal sensors, could enable comprehensive mental health monitoring and intervention.
KW - Depression
KW - Digital phenotyping
KW - Language sentiment analysis
KW - Machine learning
KW - Personal sensing
UR - http://www.scopus.com/inward/record.url?scp=85123376500&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85123376500&partnerID=8YFLogxK
U2 - 10.1016/j.jad.2021.12.048
DO - 10.1016/j.jad.2021.12.048
M3 - Article
C2 - 34963643
AN - SCOPUS:85123376500
SN - 0165-0327
VL - 302
SP - 7
EP - 14
JO - Journal of Affective Disorders
JF - Journal of Affective Disorders
ER -