A new study in the Journal of Biomedical Informatics uses machine learning on unlabeled electronic health record (EHR) data to shed light on the emergence of cardiovascular disease (CVD).
The study hinges on automated patient phenotyping (if eye color is a trait, blue eyes are a phenotype) and ample longitudinal data. Juan Zhao, Ph.D., Wei-Qi Wei, MD, Ph.D., and colleagues gathered 12,380 de-identified patient records that reached back at least 10 years prior to a CVD diagnosis. An automated scan found some 1,068 distinct patient phenotypes in this dataset.
Aided by a technique called tensor decomposition, unsupervised machine learning revealed the long-term emergence of 14 distinct CVD patient subtypes. Across the six most prevalent subtypes the risk of heart attack was markedly different, indicating the scan had struck meaningful distinctions.
Source: Read Full Article