突变率
智人
种系突变
突变
生物
生殖系
杠杆(统计)
遗传学
基因组
计算生物学
基因组学
黑腹果蝇
计算机科学
人工智能
基因
社会学
人类学
作者
Yiyuan Fang,Shuyi Deng,Cai Li
标识
DOI:10.1101/2021.10.25.465689
摘要
Abstract Germline mutation rates are essential for genetic and evolutionary analyses. Yet, estimating accurate fine-scale mutation rates across the genome is a great challenge, due to relatively few observed mutations and intricate relationships between predictors and mutation rates. Here we present MuRaL ( Mu tation Ra te L earner), a deep learning framework to predict mutation rates at the nucleotide level using only genomic sequences as input. Harnessing human germline variants for comprehensive assessment, we show that MuRaL achieves better predictive performance than current state-of-the-art methods. Moreover, MuRaL can build models with relatively few training mutations and a moderate number of sequenced individuals, and can leverage transfer learning to further reduce data and time demands. We apply MuRaL to produce genome-wide mutation rate maps for four representative species - Homo sapiens, Macaca mulatta, Arabidopsis thaliana and Drosophila melanogaster , demonstrating its high applicability. As an example, we use improved mutation rate estimates to stratify human genes into distinct groups which are enriched for different functions, and highlight that many developmental genes are subject to high mutational burden. The open-source software and generated mutation rate maps can greatly facilitate related research.
科研通智能强力驱动
Strongly Powered by AbleSci AI