计算机科学
嵌入
自编码
模式识别(心理学)
数据挖掘
人工智能
源代码
机器学习
算法
深度学习
操作系统
作者
Xinfei Wang,Chang-Qing Yu,Zhu‐Hong You,Yan Qiao,Zhengwei Li,Wenzhun Huang
标识
DOI:10.1016/j.compbiomed.2023.107421
摘要
Accumulating clinical evidence shows that circular RNA (circRNA) plays an important regulatory role in the occurrence and development of human diseases, which is expected to provide a new perspective for the diagnosis and treatment of related diseases. Using computational methods can provide high probability preselection for wet experiments to save resources. However, due to the lack of neighborhood structure in sparse biological networks, the model based on network embedding and graph embedding is difficult to achieve ideal results.In this paper, we propose BioDGW-CMI, which combines biological text mining and wavelet diffusion-based sparse network structure embedding to predict circRNA-miRNA interaction (CMI). In detail, BioDGW-CMI first uses the Bidirectional Encoder Representations from Transformers (BERT) for biological text mining to mine hidden features in RNA sequences, then constructs a CMI network, obtains the topological structure embedding of nodes in the network through heat wavelet diffusion patterns. Next, the Denoising autoencoder organically combines the structural features and Gaussian kernel similarity, finally, the feature is sent to lightGBM for training and prediction. BioDGW-CMI achieves the highest prediction performance in all three datasets in the field of CMI prediction. In the case study, all the 8 pairs of CMI based on circ-ITCH were successfully predicted.The data and source code can be found at https://github.com/1axin/BioDGW-CMI-model.
科研通智能强力驱动
Strongly Powered by AbleSci AI