信号肽
计算机科学
人工智能
计算生物学
基因组
机器学习
深度学习
管道(软件)
训练集
有机体
自然语言处理
模式识别(心理学)
肽序列
生物
生物化学
基因
遗传学
程序设计语言
作者
Junbo Shen,Qinze Yu,Shenyang Chen,Qingxiong Tan,Jingchen Li,Yu Li
标识
DOI:10.1038/s43588-023-00576-2
摘要
Signal peptide (SP) is a short peptide located in the N-terminus of proteins. It is essential to target and transfer transmembrane and secreted proteins to correct positions. Compared with traditional experimental methods to identify signal peptides, computational methods are faster and more efficient, which are more practical for analyzing thousands or even millions of protein sequences, especially for metagenomic data. Here we present Unbiased Organism-agnostic Signal Peptide Network (USPNet), a signal peptide classification and cleavage site prediction deep learning method that takes advantage of protein language models. We propose to apply label distribution-aware margin loss to handle data imbalance problems and use evolutionary information of protein to enrich representation and overcome species information dependence.
科研通智能强力驱动
Strongly Powered by AbleSci AI