卷积神经网络
联营
人工智能
模式识别(心理学)
计算机科学
判别式
特征(语言学)
深度学习
序列(生物学)
机器学习
生物
遗传学
哲学
语言学
作者
Trinh‐Trung‐Duong Nguyen,Syun Chen,Quang‐Thai Ho,Yu‐Yen Ou
出处
期刊:Proteins
[Wiley]
日期:2022-03-12
卷期号:90 (7): 1486-1492
被引量:3
摘要
Protein multiple sequence alignment information has long been important features to know about functions of proteins inferred from related sequences with known functions. It is therefore one of the underlying ideas of Alpha fold 2, a breakthrough study and model for the prediction of three-dimensional structures of proteins from their primary sequence. Our study used protein multiple sequence alignment information in the form of position-specific scoring matrices as input. We also refined the use of a convolutional neural network, a well-known deep-learning architecture with impressive achievement on image and image-like data. Specifically, we revisited the study of prediction of adenosine triphosphate (ATP)-binding sites with more efficient convolutional neural networks. We applied multiple convolutional window scanning filters of a convolutional neural network on position-specific scoring matrices for as much as useful information as possible. Furthermore, only the most specific motifs are retained at each feature map output through the one-max pooling layer before going to the next layer. We assumed that this way could help us retain the most conserved motifs which are discriminative information for prediction. Our experiment results show that a convolutional neural network with not too many convolutional layers can be enough to extract the conserved information of proteins, which leads to higher performance. Our best prediction models were obtained after examining them with different hyper-parameters. Our experiment results showed that our models were superior to traditional use of convolutional neural networks on the same datasets as well as other machine-learning classification algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI