Automatic phenotyping of electronical health record: PheVis algorithm

Abstract

Electronic Health Records (EHRs) often lack reliable annotation of patient medical conditions. Yu et al. recently proposed PheNorm, an automated unsupervised algorithm to identify patient medical conditions from EHR data. PheVis extends PheNorm at the visit resolution. PheVis combines diagnosis codes together with medical concepts extracted from medical notes, incorporating past history in a machine learning approach to provide an interpretable white box predictor of the occurrence probability for a given medical condition at each visit. PheVis is applied to two real-world use-cases using the datawarehouse of the University Hospital of Bordeaux: i) rheumatoid arthritis, a chronic condition; ii) tuberculosis, an acute condition (cross-validated AUROC were respectively 0.948 [0.945 ; 0.950] and 0.987 [0.983 ; 0.990]). PheVis performs well for chronic conditions, though absence of exclusion of past medical history by natural language processing tools limits its performance in French for acute conditions.

Publication
medRxiv 2020.06.15.20131458
Date