EHR-HGCN: An Enhanced Hybrid Approach for Text Classification Using Heterogeneous Graph Convolutional Networks in Electronic Health Records

计算机科学 判决 人工智能 自然语言处理 图形 卷积神经网络 图形数据库 生物医学文本挖掘 文本图 文本挖掘 情报检索 理论计算机科学
作者
Guishen Wang,Xiaoxue Lou,Fang Guo,Devin Kwok,Chen Cao
出处
期刊:IEEE Journal of Biomedical and Health Informatics [Institute of Electrical and Electronics Engineers]
卷期号:28 (3): 1668-1679 被引量:8
标识
DOI:10.1109/jbhi.2023.3346210
摘要

Text classification is a central part of natural language processing, with important applications in understanding the knowledge behind biomedical texts including electronic health records (EHR). In this article, we propose a novel heterogeneous graph convolutional network method for classifying EHR texts. Our method, called EHR-HGCN, is able to combine context-sensitive word and sentence embeddings with structural sentence-level and word-level relation information to perform text classification. EHR-HGCN reframes EHR text classification as a graph classification task to better capture structural information about the document using a heterogeneous graph. To mine contextual information from a document, EHR-HGCN first applies a bidirectional recurrent neural network (BiRNN) on word embeddings obtained via Global Vectors for word representation (GloVe) to obtain context-sensitive word-level and sentence-level embeddings. To mine structural relationships from the document, EHR-HGCN then constructs a heterogeneous graph over the word and sentence embeddings, where sentence-word and word-word relationships are represented by graph edges. Finally, a heterogeneous graph convolutional neural network is used to classify documents by their graph representation. We evaluate EHR-HGCN on a variety of standard text classification benchmarks and find that EHR-HGCN has higher accuracy and F1-score than other representative machine learning and deep learning methods. We also apply EHR-HGCN to the MedLit benchmark and find it performs with high accuracy and F1-score on the task of section classification in EHR texts. Our ablation experiments show that the heterogeneous graph construction and heterogeneous graph convolutional network are critical to the performance of EHR-HGCN.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
qq完成签到,获得积分10
刚刚
nostalgic完成签到,获得积分10
1秒前
Twonej举报oy求助涉嫌违规
1秒前
Lipuer发布了新的文献求助10
1秒前
年轻乐巧完成签到,获得积分10
2秒前
pp完成签到,获得积分10
2秒前
2秒前
TIAN发布了新的文献求助10
2秒前
2秒前
Jessie完成签到,获得积分10
2秒前
庞大的蝰蛇完成签到,获得积分10
2秒前
3秒前
大拿发布了新的文献求助200
3秒前
3秒前
3秒前
成就的毛衣完成签到,获得积分10
4秒前
科学修仙发布了新的文献求助10
4秒前
宜醉宜游宜睡应助dbq采纳,获得10
4秒前
4秒前
YiWei发布了新的文献求助10
5秒前
老花眼莫莫完成签到,获得积分10
5秒前
打打应助lllppp采纳,获得10
5秒前
6秒前
007关闭了007文献求助
6秒前
7秒前
haiwei完成签到,获得积分10
7秒前
8秒前
lingzhi完成签到 ,获得积分10
8秒前
枫叶人生发布了新的文献求助10
8秒前
xzn1123发布了新的文献求助30
8秒前
8秒前
8秒前
8秒前
LJW发布了新的文献求助10
9秒前
9秒前
爆米花应助眯眯眼的朋友采纳,获得10
10秒前
kindong完成签到,获得积分10
10秒前
10秒前
万能图书馆应助木桶人plus采纳,获得10
10秒前
思源应助小肆采纳,获得10
11秒前
高分求助中
Clinical Epidemiology: The Essentials, 6e 10000
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Graphene Handbook (2019 Edition) 800
Adhesion Science: Principles & Practice 800
Signals, Systems, and Signal Processing 610
IEST-RP-CC018: Cleanroom Cleaning and Sanitization: Operating and Monitoring Procedures 600
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6540487
求助须知:如何正确求助?哪些是违规求助? 8331686
关于积分的说明 17854231
捐赠科研通 5646189
什么是DOI,文献DOI怎么找? 2936335
邀请新用户注册赠送积分活动 1912418
关于科研通互助平台的介绍 1773290