相似性(几何)
疾病
联想(心理学)
秩(图论)
矩阵完成
人工智能
水准点(测量)
计算机科学
交叉验证
雅卡索引
计算生物学
机器学习
数据挖掘
高斯分布
模式识别(心理学)
数学
生物
医学
地理
病理
心理学
物理
图像(数学)
组合数学
心理治疗师
量子力学
大地测量学
作者
Cheng Yan,Guihua Duan,Fang‐Xiang Wu,Yi Pan,Jianxin Wang
标识
DOI:10.1109/tcbb.2019.2926716
摘要
With the development of high-through sequencing technology and microbiology, many studies have evidenced that microbes are associated with human diseases, such as obesity, liver cancer, and so on. Therefore, identifying the association between microbes and diseases has become an important study topic in current bioinformatics. The emergence of microbe-disease association database has provided an unprecedented opportunity to develop computational method for predicting microbe-disease associations. In the study, we propose a low-rank matrix completion method (called MCHMDA) to predict microbe-disease associations by integrating similarities of microbes and diseases and known microbe-disease associations into a heterogeneous network. The microbe similarity is computed from Gaussian Interaction Profile (GIP) kernel similarity based on the known microbe-disease associations. Then, we further improve the microbe similarity by taking into account the inhabiting organs of these microbes in human body. The disease similarity is computed by the average of disease GIP similarity, disease symptom-based similarity, and disease functional similarity. Then, we construct a heterogeneous microbe-disease association network by integrating the microbe similarity network, disease similarity network, and known microbe-disease association network. Finally, a matrix completion method is used to calculate the association scores of unknown microbe-disease pairs by the fast Singular Value Thresholding (SVT) algorithm. Via 5-fold Cross Validation (5CV) and Leave-One-Out Cross Validation (LOOCV), we evaluate the prediction performances of MCHMDA and other state-of-the-art methods which include BRWMDA, NGRHMDA, LRLSHMDA, and KATZHMDA. On benchmark dataset HMDAD, the experimental results show that MCHMDA outperforms other methods in terms of area under the receiver operating characteristic curve (AUC). MCHMDA achieves the AUC values of 0.9251 and 0.9495 in 5CV and LOOCV, respectively, which are the highest values among the competing methods. In addition, we also further indicate the prediction generality of MCHMDA on an expanded microbe-disease associations dataset (HMDAD-SUP). Finally, case studies prove the prediction ability in practical applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI