计算机科学
特征(语言学)
人工智能
编码器
源代码
编码(集合论)
特征提取
构造(python库)
机器学习
过程(计算)
代表(政治)
变压器
模式识别(心理学)
数据挖掘
工程类
政治学
操作系统
语言学
法学
程序设计语言
集合(抽象数据类型)
电压
政治
哲学
电气工程
作者
Rui Wang,Zhecheng Zhou,Xiaonan Wu,Xin Jiang,Linlin Zhuo,Mingzhe Liu,Hao Li,Xiangzheng Fu,Xiaojun Yao
标识
DOI:10.1021/acs.jcim.3c00868
摘要
Plant small secretory peptides (SSPs) play an important role in the regulation of biological processes in plants. Accurately predicting SSPs enables efficient exploration of their functions. Traditional experimental verification methods are very reliable and accurate, but they require expensive equipment and a lot of time. The method of machine learning speeds up the prediction process of SSPs, but the instability of feature extraction will also lead to further limitations of this type of method. Therefore, this paper proposes a new feature-correction-based model for SSP recognition in plants, abbreviated as SE-SSP. The model mainly includes the following three advantages: First, the use of transformer encoders can better reveal implicit features. Second, design a feature correction module suitable for sequences, named 2-D SENET, to adaptively adjust the features to obtain a more robust feature representation. Third, stack multiple linear modules to further dig out the deep information on the sample. At the same time, the training based on a contrastive learning strategy can alleviate the problem of sparse samples. We construct experiments on publicly available data sets, and the results verify that our model shows an excellent performance. The proposed model can be used as a convenient and effective SSP prediction tool in the future. Our data and code are publicly available at https://github.com/wrab12/SE-SSP/.
科研通智能强力驱动
Strongly Powered by AbleSci AI