Phenotyping through Semi-Supervised Tensor Factorization (PSST)

Jette Henderson, Huan He, Bradley A. Malin, Joshua C. Denny, Abel N. Kho, Joydeep Ghosh, Joyce C. Ho

Research output: Contribution to journalArticlepeer-review

13 Scopus citations


A computational phenotype is a set of clinically relevant and interesting characteristics that describe patients with a given condition. Various machine learning methods have been proposed to derive phenotypes in an automatic, high-throughput manner. Among these methods, computational phenotyping through tensor factorization has been shown to produce clinically interesting phenotypes. However, few of these methods incorporate auxiliary patient information into the phenotype derivation process. In this work, we introduce Phenotyping through Semi-Supervised Tensor Factorization (PSST), a method that leverages disease status knowledge about subsets of patients to generate computational phenotypes from tensors constructed from the electronic health records of patients. We demonstrate the potential of PSST to uncover predictive and clinically interesting computational phenotypes through case studies focusing on type-2 diabetes and resistant hypertension. PSST yields more discriminative phenotypes compared to the unsupervised methods and more meaningful phenotypes compared to a supervised method.

Original languageEnglish (US)
Pages (from-to)564-573
Number of pages10
JournalAMIA ... Annual Symposium proceedings. AMIA Symposium
StatePublished - Jan 1 2018

ASJC Scopus subject areas

  • General Medicine


Dive into the research topics of 'Phenotyping through Semi-Supervised Tensor Factorization (PSST)'. Together they form a unique fingerprint.

Cite this