内在无序蛋白质
球状蛋白
折叠(DSP实现)
蛋白质折叠
计算机科学
计算生物学
蛋白质组学
纳米技术
化学
生物物理学
生物
材料科学
结晶学
工程类
生物化学
基因
电气工程
作者
Jennifer Stone,Habib Boloorchi Tabrizi,Rittika Shamsuddin
标识
DOI:10.1109/bibm58861.2023.10385640
摘要
While advances have been made in the study of globular proteins, proteomics research has yet to fully address the unique realm of intrinsically disordered proteins (IDPs). In terms of structural and functional dynamics, IDPs differ from their globular counterparts. This article presents two major contributions: (1) the development of extensive IDP protein datasets for enhanced training, and (2) the enhancement of ProteinBERT’s predictive capabilities to also include IDPs along with globular proteins. As a result of the refinement of our model, we have emphasized: 1) homology predictions, which are important in light of protein sequence evolution; 2) the possibility of secondary structures; and 3) the determination of folding classes. By fine-tuning ProteinBERT, prediction accuracy is markedly improved, highlighting the importance of diversified datasets in advancing proteomic machine learning.
科研通智能强力驱动
Strongly Powered by AbleSci AI