深度学习
计算机科学
人工智能
启发式
卷积神经网络
编码
基因分型
基因组学
计算生物学
机器学习
基因组
模式识别(心理学)
生物
基因型
遗传学
基因
操作系统
作者
Victoria Popic,Chris Rohlicek,Fabio Cunial,Iman Hajirasouliha,Dmitry Meleshko,Kiran Garimella,Anant Maheshwari
出处
期刊:Nature Methods
[Springer Nature]
日期:2023-03-23
卷期号:20 (4): 559-568
被引量:22
标识
DOI:10.1038/s41592-023-01799-x
摘要
Structural variants (SVs) are a major driver of genetic diversity and disease in the human genome and their discovery is imperative to advances in precision medicine. Existing SV callers rely on hand-engineered features and heuristics to model SVs, which cannot scale to the vast diversity of SVs nor fully harness the information available in sequencing datasets. Here we propose an extensible deep-learning framework, Cue, to call and genotype SVs that can learn complex SV abstractions directly from the data. At a high level, Cue converts alignments to images that encode SV-informative signals and uses a stacked hourglass convolutional neural network to predict the type, genotype and genomic locus of the SVs captured in each image. We show that Cue outperforms the state of the art in the detection of several classes of SVs on synthetic and real short-read data and that it can be easily extended to other sequencing platforms, while achieving competitive performance. Cue achieves versatile and performant structural variant calling and genotyping using a deep-learning approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI