蛋白质结构
情态动词
计算生物学
计算机科学
生物
化学
生物化学
高分子化学
作者
Sheng Chen,Sen Zhang,Xiaoyu Fang,Liang Lin,Huiying Zhao,Yuedong Yang
标识
DOI:10.1038/s41467-024-53116-5
摘要
Cryo-electron microscopy (cryo-EM) technique is widely used for protein structure determination. Current automatic cryo-EM protein complex modeling methods mostly rely on prior chain separation. However, chain separation without sequence guidance often suffers from errors caused by cross-chain interaction or noise densities, which would accumulate and mislead the subsequent steps. Here, we present EModelX, a fully automated cryo-EM protein complex structure modeling method, which achieves sequence-guiding modeling through cross-modal alignments between cryo-EM maps and protein sequences. EModelX first employs multi-task deep learning to predict Cα atoms, backbone atoms, and amino acid types from cryo-EM maps, which is subsequently used to sample Cα traces with amino acid profiles. The profiles are then aligned with protein sequences to obtain initial structural models, which yielded an average RMSD of 1.17 Å in our test set, approaching atomic-level precision in recovering PDB-deposited structures. After filling unmodeled gaps through sequence-guiding Cα threading, the final models achieved an average TM-score of 0.808, outperforming the state-of-the-art method. The further combination with AlphaFold can improve the average TM-score to 0.911. Analyzes conducted by comparing some EModelX-built models and PDB structures highlight its potential to improve PDB structures. EModelX is accessible at https://bio-web1.nscc-gz.cn/app/EModelX . Here the authors report EModelX, a fully automated method for modeling protein structures using cryoEM data. By aligning cryo-EM maps with protein sequences through deep learning, it achieves near-atomic precision and outperforms existing methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI