Piwi相互作用RNA
计算机科学
机制(生物学)
计算生物学
人工智能
深度学习
序列(生物学)
嵌入
假结
生物
核糖核酸
遗传学
RNA干扰
基因
哲学
认识论
作者
Yajun Liu,Yulian Ding,Aimin Li,Rong Fei,Guo Xie,Fang‐Xiang Wu
标识
DOI:10.1109/bibm55620.2022.9995306
摘要
PIWI-interacting RNAs (piRNAs) are a type of small non-coding RNAs which bind with the PIWI proteins to exert biological effects in various regulatory mechanisms. A growing amount of evidence reveals that exosomal piRNAs are potential biomarkers for diagnosis and treatment of complex diseases. Effective methods for the prediction of exosomal piRNAs are the foundation of piRNA functional research. In this study, we propose an end-to-end deep network for identifying exosomal piRNAs based on features learned from natural language processing (NLP) models for sequence embedding with attention mechanism. First, a benchmark dataset is constructed by processing piRNA subcellular localization annotated data and sequence data. Moreover, bagging positive unlabeled learning is applied to get the reliable negative set. Finally, we treat a piRNA sequence as a sentence and its k-mer subsequence as a token. Sequence embedding models with self-attention mechanism is designed to extract features from exosome piRNA sequences, which are used for the prediction task. Compared with three competing methods, our model achieves the best performance and reveals the key factors of exosomal piRNA sequences by the attention mechanism. Our model characterizes exosomal piRNAs and could be beneficial for researchers to investigate exosomal piRNAs’ functions.
科研通智能强力驱动
Strongly Powered by AbleSci AI