鉴定(生物学)
计算生物学
计算机科学
水准点(测量)
注释
人工神经网络
一般化
人工智能
源代码
机器学习
生物
程序设计语言
数学
植物
数学分析
地理
大地测量学
作者
Bowen Li,Guanxiang Liang
标识
DOI:10.1101/2023.12.29.573676
摘要
Abstract Bacteriophages, also known as phages, are essential for the stability of the microbiome system due to their ability to infect prokaryotes, another significant component of the microbiome. Thus, understanding the functions of phage proteins could help us unravel the nature of phages and their roles in the microbiome. However, limited by the low throughput of experimental techniques, a vast number of phage proteins remain unannotated in terms of their functions. Computational methods are expected to solve this restriction due to their high throughput and cost-effectiveness. In this study, we focused on one aspect of functional annotation for phage proteins, the identification and classification of phage virion proteins, and the integration of a large pretrained protein language model and an MLP neural network dramatically improved the performance of these two tasks. Additionally, we compared our model with some previous deep learning models using a newly collected, independent benchmark dataset, demonstrating the strong generalization ability of our model for both tasks. The source codes of ESM-PVP and the software for the PVP identification task have been uploaded to: https://github.com/li-bw18/ESM-PVP .
科研通智能强力驱动
Strongly Powered by AbleSci AI