疾病
败血症
代表(政治)
同种类的
计算机科学
健康档案
医学
数据科学
空格(标点符号)
临床表型
重症监护医学
生物信息学
数据挖掘
表型
医疗保健
生物
病理
免疫学
法学
政治学
经济增长
生物化学
政治
热力学
基因
经济
物理
操作系统
作者
Djordje Gligorijevic,Jelena Gligorijević,Zoran Obradović
出处
期刊:Methods
[Elsevier]
日期:2016-12-01
卷期号:111: 45-55
被引量:12
标识
DOI:10.1016/j.ymeth.2016.07.021
摘要
Data-driven phenotype discoveries on Electronic Health Records (EHR) data have recently drawn benefits across many aspects of clinical practice. In the method described in this paper, we map a very large EHR database containing more than a million inpatient cases into a low dimensional space where diseases with similar phenotypes have similar representation. This embedding allows for an effective segmentation of diseases into more homogeneous categories, an important task of discovering disease types for precision medicine. In particular, many diseases have heterogeneous nature. For instance, sepsis, a systemic and progressive inflammation, can be caused by many factors, and can have multiple manifestations on different human organs. Understanding such heterogeneity of the disease can help in addressing many important issues regarding sepsis, including early diagnosis and treatment, which is of huge importance as sepsis is one of the main causes of in-hospital deaths in the United States. This study analyzes state of the art embedding models that have had huge success in various fields, applying them to disease embedding from EHR databases. Particular interest is given to learning multi-type representation of heterogeneous diseases, which leads to more homogeneous groups. Our results show evidence that such representations have phenotypes of higher quality and also provide benefit when predicting mortality of inpatient visits.
科研通智能强力驱动
Strongly Powered by AbleSci AI