TY - JOUR
T1 - Development of a natural language processing algorithm to identify and evaluate transgender patients in electronic health record systems
AU - Ehrenfeld, Jesse M.
AU - Gottlieb, Keanan Gabriel
AU - Beach, Lauren Brittany
AU - Monahan, Shelby E.
AU - Fabbri, Daniel
N1 - Publisher Copyright:
© 2019 Ethnicity and Disease, Inc.. All rights reserved.
PY - 2019
Y1 - 2019
N2 - Objective: To create a natural language processing (NLP) algorithm to identify transgender patients in electronic health records. Design: We developed an NLP algorithm to identify patients (keyword + billing codes). Patients were manually reviewed, and their health care services categorized by billing code. Setting: Vanderbilt University Medical Center Participants: 234 adult and pediatric transgender patients Main Outcome Measures: Number of transgender patients correctly identified and categorization of health services utilized. Results: We identified 234 transgender patients of whom 50% had a diagnosed mental health condition, 14% were living with HIV, and 7% had diabetes. Largely driven by hormone use, nearly half of patients attended the Endocrinology/Diabetes/Metabolism clinic. Many patients also attended the Psychiatry, HIV, and/or Obstetrics/Gynecology clinics. The false positive rate of our algorithm was 3%. Conclusions: Our novel algorithm correctly identified transgender patients and provided important insights into health care utilization among this marginalized population.
AB - Objective: To create a natural language processing (NLP) algorithm to identify transgender patients in electronic health records. Design: We developed an NLP algorithm to identify patients (keyword + billing codes). Patients were manually reviewed, and their health care services categorized by billing code. Setting: Vanderbilt University Medical Center Participants: 234 adult and pediatric transgender patients Main Outcome Measures: Number of transgender patients correctly identified and categorization of health services utilized. Results: We identified 234 transgender patients of whom 50% had a diagnosed mental health condition, 14% were living with HIV, and 7% had diabetes. Largely driven by hormone use, nearly half of patients attended the Endocrinology/Diabetes/Metabolism clinic. Many patients also attended the Psychiatry, HIV, and/or Obstetrics/Gynecology clinics. The false positive rate of our algorithm was 3%. Conclusions: Our novel algorithm correctly identified transgender patients and provided important insights into health care utilization among this marginalized population.
KW - Electronic health records
KW - Natural language processing
KW - Transgender
KW - Utilization
UR - http://www.scopus.com/inward/record.url?scp=85069941515&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85069941515&partnerID=8YFLogxK
U2 - 10.18865/ED.29.S2.441
DO - 10.18865/ED.29.S2.441
M3 - Article
C2 - 31308617
AN - SCOPUS:85069941515
SN - 1049-510X
VL - 29
SP - 441
EP - 450
JO - Ethnicity and Disease
JF - Ethnicity and Disease
ER -