计算机科学
Piwi相互作用RNA
自编码
人工智能
水准点(测量)
试验装置
集合(抽象数据类型)
嵌入
机器学习
数据挖掘
计算生物学
模式识别(心理学)
人工神经网络
基因组
生物
基因
遗传学
转座因子
大地测量学
程序设计语言
地理
作者
Yajun Liu,Fan Zhang,Yulian Ding,Rong Fei,Junhuai Li,Fang‐Xiang Wu
摘要
Abstract PIWI‐interacting RNAs (piRNAs) are a typical class of small non‐coding RNAs, which are essential for gene regulation, genome stability and so on. Accumulating studies have revealed that piRNAs have significant potential as biomarkers and therapeutic targets for a variety of diseases. However current computational methods face the challenge in effectively capturing piRNA‐disease associations (PDAs) from limited data. In this study, we propose a novel method, MRDPDA, for predicting PDAs based on limited data from multiple sources. Specifically, MRDPDA integrates a deep factorization machine (deepFM) model with regularizations derived from multiple yet limited datasets, utilizing separate Laplacians instead of a simple average similarity network. Moreover, a unified objective function to combine embedding loss about similarities is proposed to ensure that the embedding is suitable for the prediction task. In addition, a balanced benchmark dataset based on piRPheno is constructed and a deep autoencoder is applied for creating reliable negative set from the unlabeled dataset. Compared with three latest methods, MRDPDA achieves the best performance on the pirpheno dataset in terms of the five‐fold cross validation test and independent test set, and case studies further demonstrate the effectiveness of MRDPDA.
科研通智能强力驱动
Strongly Powered by AbleSci AI