计算机科学
疾病
人工智能
分类器(UML)
图形
机器学习
数据挖掘
模式识别(心理学)
医学
病理
理论计算机科学
作者
Siyuan Shen,Yurong Qian,Jingjing Zheng,Junyi Liu,Lei Deng
标识
DOI:10.1109/bibm52615.2021.9669467
摘要
Many studies have shown that circRNAs play essential roles in various biological processes. With the development of technology, the associations between circRNA and diseases have been discovered, and these associations will help diagnose and treat diseases. However, it is time-consuming and costly to detect the associations between circRNAs and diseases with the experimental methods. Therefore, it is necessary to develop a feasible and effective computational method for predicting circRNA-disease associations. In this paper, we propose a new computational framework called VLCDA to identify the potential circRNA-disease associations. Initially, we construct features by fusing circRNA expression profile features and circRNA protein-coding ability features, disease semantic features, circRNA and disease GIP Kernel features, and use VGAE to mine its deep latent features. Finally, we use the fusion features to train the LightGBM classifier and the trained LightGBM to identify the circRNA-disease associations. The main contribution of VLCDA is that we firstly add circRNA protein-coding ability feature to the circRNA-disease association prediction model. In addition, VLCDA uses variational graph auto-encoders to extract the latent features of circRNA-disease associations to improve the prediction model’s accuracy further. VLCDA obtained the area under the ROC curve (AUC) scores of 0.9783 in 5-fold cross-validation. In addition, in the case studies, 16 of the top 20 circRNA-disease associations predicted by VLCDA have been confirmed by relevant literature.
科研通智能强力驱动
Strongly Powered by AbleSci AI