计算机科学
计算生物学
编码(社会科学)
致病性
生物
数学
微生物学
统计
作者
Yijia Chen,Yiwen Chen,Shanling Nie,Hai Yang
标识
DOI:10.1109/bibm58861.2023.10385413
摘要
Genomic variants, which can disrupt cellular functions, present a challenge in distinguishing deleterious from benign instances. While assessing genome-wide functional impacts, many current algorithms neglect protein tertiary structure of coding region variants due to limitations in protein structural prediction. This study introduces PMMVar, an advanced multimodal deep convolutional network, which adeptly integrates protein tertiary structures with conservation properties from ESM-2, supplemented by other protein structural sequences. PMMVar achieves outstanding performance on the latest clinical variant datasets, NCBI ClinVar (2023), and the Mendelian variant dataset, surpassing existing benchmarks. Ablation analyses validate the significance of protein multi-level structures in enhancing the model's accuracy. Overall, our findings spotlight the essential role of multi-level protein structures in pathogenicity predictions and their potential to discern deleterious genomic variants effectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI