计算机科学
卷积神经网络
人工智能
深度学习
基因组学
基因组
转座因子
人工神经网络
机器学习
模式识别(心理学)
生物
生物化学
基因
作者
Simón Orozco-Arias,Luis Humberto López-Murillo,Johan S. Piña,Estiven Valencia-Castrillon,Reinel Tabares-Soto,Luis Fernando Castillo Ossa,Gustavo Isaza,Romain Guyot
出处
期刊:PLOS ONE
[Public Library of Science]
日期:2023-09-21
卷期号:18 (9): e0291925-e0291925
被引量:1
标识
DOI:10.1371/journal.pone.0291925
摘要
Analysis of eukaryotic genomes requires the detection and classification of transposable elements (TEs), a crucial but complex and time-consuming task. To improve the performance of tools that accomplish these tasks, Machine Learning approaches (ML) that leverage computer resources, such as GPUs (Graphical Processing Unit) and multiple CPU (Central Processing Unit) cores, have been adopted. However, until now, the use of ML techniques has mostly been limited to classification of TEs. Herein, a detection-classification strategy (named YORO) based on convolutional neural networks is adapted from computer vision (YOLO) to genomics. This approach enables the detection of genomic objects through the prediction of the position, length, and classification in large DNA sequences such as fully sequenced genomes. As a proof of concept, the internal protein-coding domains of LTR-retrotransposons are used to train the proposed neural network. Precision, recall, accuracy, F1-score, execution times and time ratios, as well as several graphical representations were used as metrics to measure performance. These promising results open the door for a new generation of Deep Learning tools for genomics. YORO architecture is available at https://github.com/simonorozcoarias/YORO .
科研通智能强力驱动
Strongly Powered by AbleSci AI