计算机科学
缺少数据
相似性(几何)
领域(数学分析)
数据挖掘
系列(地层学)
时间序列
人工智能
机器学习
特征(语言学)
数学
数学分析
哲学
图像(数学)
生物
古生物学
语言学
作者
Baoyao Yang,Mang Ye,Qingxiong Tan,Pong C. Yuen
出处
期刊:IEEE transactions on cybernetics
[Institute of Electrical and Electronics Engineers]
日期:2020-08-14
卷期号:52 (5): 3394-3407
被引量:14
标识
DOI:10.1109/tcyb.2020.3011934
摘要
Medical time series of laboratory tests has been collected in electronic health records (EHRs) in many countries. Machine-learning algorithms have been proposed to analyze the condition of patients using these medical records. However, medical time series may be recorded using different laboratory parameters in different datasets. This results in the failure of applying a pretrained model on a test dataset containing a time series of different laboratory parameters. This article proposes to solve this problem with an unsupervised time-series adaptation method that generates time series across laboratory parameters. Specifically, a medical time-series generation network with similarity distillation is developed to reduce the domain gap caused by the difference in laboratory parameters. The relations of different laboratory parameters are analyzed, and the similarity information is distilled to guide the generation of target-domain specific laboratory parameters. To further improve the performance in cross-domain medical applications, a missingness-aware feature extraction network is proposed, where the missingness patterns reflect the health conditions and, thus, serve as auxiliary features for medical analysis. In addition, we also introduce domain-adversarial networks in both feature level and time-series level to enhance the adaptation across domains. Experimental results show that the proposed method achieves good performance on both private and publicly available medical datasets. Ablation studies and distribution visualization are provided to further analyze the properties of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI