错义突变
计算机科学
联营
机器学习
人工智能
人工神经网络
计算生物学
人口
PTEN公司
生物
遗传学
突变
基因
社会学
人口学
细胞凋亡
PI3K/AKT/mTOR通路
作者
Haicang Zhang,Michelle S. Xu,Wendy K. Chung,Yufeng Shen
标识
DOI:10.1101/2021.04.22.441037
摘要
Abstract Accurate prediction of damaging missense variants is critically important for interpreting genome sequence. While many methods have been developed, their performance has been limited. Recent progress in machine learning and availability of large-scale population genomic sequencing data provide new opportunities to significantly improve computational predictions. Here we describe gMVP, a new method based on graph attention neural networks. Its main component is a graph with nodes capturing predictive features of amino acids and edges weighted by coevolution strength, which enables effective pooling of information from local protein context and functionally correlated distal positions. Evaluated by deep mutational scan data, gMVP outperforms published methods in identifying damaging variants in TP53, PTEN, BRCA1 , and MSH2 . Additionally, it achieves the best separation of de novo missense variants in neurodevelopmental disorder cases from the ones in controls. Finally, the model supports transfer learning to optimize gain- and loss-of-function predictions in sodium and calcium channels. In summary, we demonstrate that gMVP can improve interpretation of missense variants in clinical testing and genetic studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI