TY - JOUR
T1 - Using electronic health record data to link families
T2 - an illustrative example using intergenerational patterns of obesity
AU - Krefman, Amy E.
AU - Ghamsari, Farhad
AU - Turner, Daniel R.
AU - Lu, Alice
AU - Borsje, Martin
AU - Wood, Colby Witherup
AU - Petito, Lucia C.
AU - Polubriaginof, Fernanda C.G.
AU - Schneider, Daniel
AU - Ahmad, Faraz
AU - Allen, Norrina B.
N1 - Publisher Copyright:
© The Author(s) 2023.
PY - 2023/5/1
Y1 - 2023/5/1
N2 - Objective: Electronic health record (EHR) data are a valuable resource for population health research but lack critical information such as relationships between individuals. Emergency contacts in EHRs can be used to link family members, creating a population that is more representative of a community than traditional family cohorts. Materials and Methods: We revised a published algorithm: relationship inference from the electronic health record (RIFTEHR). Our version, Pythonic RIFTEHR (P-RIFTEHR), identifies a patient's emergency contacts, matches them to existing patients (when available) using network graphs, checks for conflicts, and infers new relationships. P-RIFTEHR was run on December 15, 2021 in the Northwestern Medicine Electronic Data Warehouse (NMEDW) on approximately 2.95 million individuals and was validated using the existing link between children born at NM hospitals and their mothers. As proof-of-concept, we modeled the association between parent and child obesity using logistic regression. Results: The P-RIFTEHR algorithm matched 1 157 454 individuals in 448 278 families. The median family size was 2, the largest was 32 persons, and 247 families spanned 4 generations or more. Validation of the mother-child pairs resulted in 95.1% sensitivity. Children were 2 times more likely to be obese if a parent is obese (OR: 2.30; 95% CI, 2.23-2.37). Conclusion: P-RIFTEHR can identify familiar relationships in a large, diverse population in an integrated health system. Estimates of parent-child inheritability of obesity using family structures identified by the algorithm were consistent with previously published estimates from traditional cohort studies.
AB - Objective: Electronic health record (EHR) data are a valuable resource for population health research but lack critical information such as relationships between individuals. Emergency contacts in EHRs can be used to link family members, creating a population that is more representative of a community than traditional family cohorts. Materials and Methods: We revised a published algorithm: relationship inference from the electronic health record (RIFTEHR). Our version, Pythonic RIFTEHR (P-RIFTEHR), identifies a patient's emergency contacts, matches them to existing patients (when available) using network graphs, checks for conflicts, and infers new relationships. P-RIFTEHR was run on December 15, 2021 in the Northwestern Medicine Electronic Data Warehouse (NMEDW) on approximately 2.95 million individuals and was validated using the existing link between children born at NM hospitals and their mothers. As proof-of-concept, we modeled the association between parent and child obesity using logistic regression. Results: The P-RIFTEHR algorithm matched 1 157 454 individuals in 448 278 families. The median family size was 2, the largest was 32 persons, and 247 families spanned 4 generations or more. Validation of the mother-child pairs resulted in 95.1% sensitivity. Children were 2 times more likely to be obese if a parent is obese (OR: 2.30; 95% CI, 2.23-2.37). Conclusion: P-RIFTEHR can identify familiar relationships in a large, diverse population in an integrated health system. Estimates of parent-child inheritability of obesity using family structures identified by the algorithm were consistent with previously published estimates from traditional cohort studies.
KW - cohort studies
KW - electronic health record
KW - family characteristics
KW - obesity
KW - population health
UR - http://www.scopus.com/inward/record.url?scp=85152977187&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85152977187&partnerID=8YFLogxK
U2 - 10.1093/jamia/ocad028
DO - 10.1093/jamia/ocad028
M3 - Article
C2 - 36857086
AN - SCOPUS:85152977187
SN - 1067-5027
VL - 30
SP - 915
EP - 922
JO - Journal of the American Medical Informatics Association
JF - Journal of the American Medical Informatics Association
IS - 5
ER -