计算机科学
蛋白质组学
特征提取
深度学习
人工智能
特征(语言学)
蛋白质测序
序列(生物学)
模式识别(心理学)
计算生物学
肽序列
化学
生物
生物化学
语言学
哲学
基因
作者
Chaoyang Liu,Bo Wang,Xinhong Zhang,Fan Zhang
摘要
Proteins secreted by various cells and tissues into different body fluids can signal several physiological disorders. However, the degree of complexity in different body fluids and the presence of a large number of proteins in fluids can make studying them with existing proteomics techniques complex and result in large discrepancies between experimental studies. To address this, we developed a deep learning framework called SecBert that identifies secreted proteins in two human body fluids. SecBert uses a sequence-based approach with end-to-end automatic feature extraction for protein classification. Our results show that SecBert performs well, achieving an average area under the ROC curve of 0.94-0.95 on each fluid test dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI