Causal knowledge graph construction and evaluation for clinical decision support of diabetic nephropathy

计算机科学 临床决策支持系统 图形 数据挖掘 知识抽取 机器学习 人工智能 决策支持系统 情报检索 自然语言处理 理论计算机科学
作者
Kewei Lyu,Yu Tian,Yong Shang,Tianshu Zhou,Ziyue Yang,Qianghua Liu,Xi Yao,Ping Zhang,Jianghua Chen,Jingsong Li
出处
期刊:Journal of Biomedical Informatics [Elsevier BV]
卷期号:139: 104298-104298 被引量:24
标识
DOI:10.1016/j.jbi.2023.104298
摘要

Many important clinical decisions require causal knowledge (CK) to take action. Although many causal knowledge bases for medicine have been constructed, a comprehensive evaluation based on real-world data and methods for handling potential knowledge noise are still lacking. The objectives of our study are threefold: (1) propose a framework for the construction of a large-scale and high-quality causal knowledge graph (CKG); (2) design the methods for knowledge noise reduction to improve the quality of the CKG; (3) evaluate the knowledge completeness and accuracy of the CKG using real-world data. We extracted causal triples from three knowledge sources (SemMedDB, UpToDate and Churchill's Pocketbook of Differential Diagnosis) based on rule methods and language models, performed ontological encoding, and then designed semantic modeling between electronic health record (EHR) data and the CKG to complete knowledge instantiation. We proposed two graph pruning strategies (co-occurrence ratio and causality ratio) to reduce the potential noise introduced by SemMedDB. Finally, the evaluation was carried out by taking the diagnostic decision support (DDS) of diabetic nephropathy (DN) as a real-world case. The data originated from a Chinese hospital EHR system from October 2010 to October 2020. The knowledge completeness and accuracy of the CKG were evaluated based on three state-of-the-art embedding methods (R-GCN, MHGRN and MedPath), the annotated clinical text and the expert review, respectively. This graph included 153,289 concepts and 1,719,968 causal triples. A total of 1427 inpatient data were used for evaluation. Better results were achieved by combining three knowledge sources than using only SemMedDB (three models: area under the receiver operating characteristic curve (AUC): p < 0.01, F1: p < 0.01), and the graph covered 93.9 % of the causal relations between diseases and diagnostic evidence recorded in clinical text. Causal relations played a vital role in all relations related to disease progression for DDS of DN (three models: AUC: p > 0.05, F1: p > 0.05), and after pruning, the knowledge accuracy of the CKG was significantly improved (three models: AUC: p < 0.01, F1: p < 0.01; expert review: average accuracy: + 5.5 %). The results demonstrated that our proposed CKG could completely and accurately capture the abstract CK under the concrete EHR data, and the pruning strategies could improve the knowledge accuracy of our CKG. The CKG has the potential to be applied to the DDS of diseases.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
迅速的幻雪完成签到 ,获得积分10
1秒前
lmm6701完成签到,获得积分10
3秒前
PPSlu完成签到,获得积分10
10秒前
TUTU完成签到 ,获得积分10
11秒前
研友_VZGVzn完成签到,获得积分10
15秒前
腻腻发布了新的文献求助10
17秒前
祁灵枫完成签到,获得积分10
21秒前
申燕婷完成签到 ,获得积分10
22秒前
蔡从安发布了新的文献求助10
24秒前
Asumita完成签到,获得积分10
25秒前
小山己几发布了新的文献求助10
26秒前
盟主完成签到 ,获得积分10
33秒前
wQ1ng应助蔡从安采纳,获得10
34秒前
MENG完成签到,获得积分10
35秒前
Sleven完成签到,获得积分10
36秒前
大力道罡完成签到,获得积分10
37秒前
hhh2018687完成签到,获得积分10
40秒前
木雨亦潇潇完成签到,获得积分10
40秒前
oleskarabach发布了新的文献求助10
43秒前
独特的忆彤完成签到 ,获得积分10
46秒前
笑林完成签到 ,获得积分10
52秒前
彭于晏应助山水之乐采纳,获得10
54秒前
从容的水壶完成签到 ,获得积分10
54秒前
赟yun完成签到,获得积分0
55秒前
Pure完成签到 ,获得积分10
55秒前
吉祥高趙完成签到 ,获得积分10
55秒前
59秒前
laber完成签到,获得积分0
1分钟前
华仔应助怕黑的金鱼采纳,获得10
1分钟前
Alanni完成签到 ,获得积分10
1分钟前
丸子完成签到 ,获得积分10
1分钟前
34882738完成签到 ,获得积分10
1分钟前
sora完成签到,获得积分10
1分钟前
1分钟前
山水之乐发布了新的文献求助10
1分钟前
jscr完成签到,获得积分10
1分钟前
明理从露完成签到 ,获得积分10
1分钟前
思源应助科研通管家采纳,获得10
1分钟前
laber应助科研通管家采纳,获得50
1分钟前
1分钟前
高分求助中
Pipeline and riser loss of containment 2001 - 2020 (PARLOC 2020) 1000
哈工大泛函分析教案课件、“72小时速成泛函分析:从入门到入土.PDF”等 660
Comparing natural with chemical additive production 500
The Leucovorin Guide for Parents: Understanding Autism’s Folate 500
Phylogenetic study of the order Polydesmida (Myriapoda: Diplopoda) 500
A Manual for the Identification of Plant Seeds and Fruits : Second revised edition 500
The Social Work Ethics Casebook: Cases and Commentary (revised 2nd ed.) 400
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 内科学 生物化学 物理 计算机科学 纳米技术 遗传学 基因 复合材料 化学工程 物理化学 病理 催化作用 免疫学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 5212175
求助须知:如何正确求助?哪些是违规求助? 4388435
关于积分的说明 13663849
捐赠科研通 4248864
什么是DOI,文献DOI怎么找? 2331208
邀请新用户注册赠送积分活动 1328931
关于科研通互助平台的介绍 1282248