MNNMDA: Predicting human microbe-disease association via a method to minimize matrix nuclear norm

计算机科学 疾病 正规化(语言学) 相似性(几何) 高斯分布 二部图 矩阵范数 核基质 人工智能 计算生物学 数据挖掘 机器学习 数学 模式识别(心理学) 医学 生物 理论计算机科学 特征向量 化学 病理 遗传学 图像(数学) 物理 图形 计算化学 染色质 DNA 量子力学
作者
Haiyan Liu,Pingping Bing,Meijun Zhang,Geng Tian,Jun Ma,Haigang Li,Meihua Bao,Kunhui He,Jianjun He,Binsheng He,Jialiang Yang
出处
期刊:Computational and structural biotechnology journal [Elsevier BV]
卷期号:21: 1414-1423 被引量:25
标识
DOI:10.1016/j.csbj.2022.12.053
摘要

Identifying the potential associations between microbes and diseases is the first step for revealing the pathological mechanisms of microbe-associated diseases. However, traditional culture-based microbial experiments are expensive and time-consuming. Thus, it is critical to prioritize disease-associated microbes by computational methods for further experimental validation. In this study, we proposed a novel method called MNNMDA, to predict microbe-disease associations (MDAs) by applying a Matrix Nuclear Norm method into known microbe and disease data. Specifically, we first calculated Gaussian interaction profile kernel similarity and functional similarity for diseases and microbes. Then we constructed a heterogeneous information network by combining the integrated disease similarity network, the integrated microbe similarity network and the known microbe-disease bipartite network. Finally, we formulated the microbe-disease association prediction problem as a low-rank matrix completion problem, which was solved by minimizing the nuclear norm of a matrix with a few regularization terms. We tested the performances of MNNMDA in three datasets including HMDAD, Disbiome, and Combined Data with small, medium and large sizes respectively. We also compared MNNMDA with 5 state-of-the-art methods including KATZHMDA, LRLSHMDA, NTSHMDA, GATMDA, and KGNMDA, respectively. MNNMDA achieved area under the ROC curves (AUROC) of 0.9536 and 0.9364 respectively on HDMAD and Disbiome, better than the AUCs of compared methods under the 5-fold cross-validation for all microbe-disease associations. It also obtained a relatively good performance with AUROC 0.8858 in the combined data. In addition, MNNMDA was also better than other methods in area under precision and recall curve (AUPR) under the 5-fold cross-validation for all associations, and in both AUROC and AUPR under the 5-fold cross-validation for diseases and the 5-fold cross-validation for microbes. Finally, the case studies on colon cancer and inflammatory bowel disease (IBD) also validated the effectiveness of MNNMDA. In conclusion, MNNMDA is an effective method in predicting microbe-disease associations. The codes and data for this paper are freely available at Github https://github.com/Haiyan-Liu666/MNNMDA.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研通AI6.1应助马前人采纳,获得10
1秒前
1618完成签到 ,获得积分10
2秒前
李y梅子完成签到 ,获得积分10
2秒前
zbidnh完成签到,获得积分10
3秒前
月亮完成签到 ,获得积分10
4秒前
5秒前
sdfwsdfsd完成签到,获得积分10
6秒前
uuu完成签到 ,获得积分10
8秒前
Cheney完成签到,获得积分10
8秒前
孤舟完成签到,获得积分10
11秒前
马前人发布了新的文献求助10
13秒前
芹菜完成签到,获得积分10
14秒前
jn完成签到,获得积分10
14秒前
小猪完成签到,获得积分10
15秒前
尊敬的小凡完成签到,获得积分10
16秒前
aaa完成签到,获得积分10
16秒前
啦啦啦完成签到,获得积分10
16秒前
17秒前
糖豆子完成签到,获得积分10
17秒前
kk完成签到 ,获得积分10
17秒前
树袋熊完成签到,获得积分10
20秒前
davyean完成签到,获得积分10
21秒前
rice0601完成签到,获得积分10
21秒前
卓向梦完成签到,获得积分10
22秒前
韩医生口腔完成签到 ,获得积分10
22秒前
cookiezhu01完成签到 ,获得积分10
23秒前
江霭完成签到,获得积分10
24秒前
LeeSunE完成签到,获得积分10
25秒前
leiguangbin发布了新的文献求助10
26秒前
28秒前
止戈为武完成签到,获得积分0
29秒前
怡然的飞珍完成签到,获得积分10
29秒前
马前人发布了新的文献求助10
29秒前
33秒前
笑林完成签到 ,获得积分0
35秒前
huang完成签到,获得积分10
36秒前
小于子88完成签到,获得积分10
37秒前
丰富的澜完成签到 ,获得积分10
39秒前
尊敬的驳完成签到,获得积分10
39秒前
nnnnn完成签到,获得积分10
40秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Cronologia da história de Macau 5000
Merrill's Atlas of Radiographic Positioning and Procedures - 3-Volume Set, 16th Edition 2000
晚清天文学译著《谈天》版本考 720
Matrix Methods in Data Mining and Pattern Recognition 510
Calibre SVRF (Standard Verification Rule Format) Manual 2021 500
Interactions of Vowel Quality and Prosody in East Slavic 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 内科学 物理 复合材料 催化作用 细胞生物学 无机化学 光电子学 物理化学 电极 基因
热门帖子
关注 科研通微信公众号,转发送积分 7087082
求助须知:如何正确求助?哪些是违规求助? 8744798
关于积分的说明 18495441
捐赠科研通 6634302
什么是DOI,文献DOI怎么找? 3134459
关于科研通互助平台的介绍 2239602
邀请新用户注册赠送积分活动 2109176