卷积神经网络
集合(抽象数据类型)
计算机科学
人工智能
肽
机器学习
序列(生物学)
人工神经网络
蛋白质-蛋白质相互作用
计算生物学
化学
生物
生物化学
程序设计语言
作者
Van-The Le,Zijun Zhan,Thi-Thu-Phuong Vu,Muhammad Shahid Malik,Yu‐Yen Ou
标识
DOI:10.1016/j.jmgm.2024.108777
摘要
This study delves into the prediction of protein-peptide interactions using advanced machine learning techniques, comparing models such as sequence-based, standard CNNs, and traditional classifiers. Leveraging pre-trained language models and multi-view window scanning CNNs, our approach yields significant improvements, with ProtTrans standing out based on 2.1 billion protein sequences and 393 billion amino acids. The integrated model demonstrates remarkable performance, achieving an AUC of 0.856 and 0.823 on the PepBCL Set_1 and Set_2 datasets, respectively. Additionally, it attains a Precision of 0.564 in PepBCL Set 1 and 0.527 in PepBCL Set 2, surpassing the performance of previous methods. Beyond this, we explore the application of this model in cancer therapy, particularly in identifying peptide interactions for selective targeting of cancer cells, and other fields. The findings of this study contribute to bioinformatics, providing valuable insights for drug discovery and therapeutic development.
科研通智能强力驱动
Strongly Powered by AbleSci AI