三肽
计算生物学
二肽基肽酶
杠杆(统计)
计算机科学
肽
抑制性突触后电位
酶
人工智能
化学
生物化学
生物
神经科学
作者
Changge Guan,Juan Luo,Shucheng Li,Zheng Lin Tan,Jiahao Li,Zourun Wu,Yi Wang,Haihong Chen,Naoyuki Yamamoto,Chong Zhang,Junjie Chen,Xin‐Hui Xing
标识
DOI:10.1101/2022.06.13.495896
摘要
Abstract Mining of anti-diabetic dipeptidyl peptidase IV (DPP-IV) inhibitory peptides (DPP-IV-IPs) is currently a costly and laborious process. Due to the absence of rational peptide design rules, it relies on cumbersome screening of unknown enzyme hydrolysates. Here, we present an enhanced deep learning (DL) model called BERT-DPPIV, specifically designed to classify DPP-IV-IPs and exploring their design rules to discover potent candidates. The end-to-end model utilizes a fine-tuned bidirectional encoder representations (BERT) architecture to extract structural/functional information from input peptides and accurately identify DPP-IV-Ips from input peptides. Experimental results in benchmark dataset showed BERT-DPPIV yielded state-of-the-art accuracy of 0.894, surpassing the 0.797 obtained by sequence-feature model. Furthermore, we leverage the attention mechanism to uncover that our model could recognize restriction enzyme cutting site and specific residues that contribute to the inhibition of DPP-IV. Moreover, guided by BERT-DPPIV, proposed design rules of DPP-IV inhibitory tripeptides and pentapeptides were validated and they can be used to screen potent DPP-IV-IPs.
科研通智能强力驱动
Strongly Powered by AbleSci AI