支持向量机
人工智能
计算机科学
分类器(UML)
水准点(测量)
模式识别(心理学)
模糊逻辑
机器学习
数据挖掘
核(代数)
数学
地理
大地测量学
组合数学
作者
Yi Zou,Hongjie Wu,Xiaoyi Guo,Peng Li,Yijie Ding,Jijun Tang,Fei Guo
出处
期刊:Current Bioinformatics
[Bentham Science]
日期:2020-06-07
卷期号:16 (2): 274-283
被引量:56
标识
DOI:10.2174/1574893615999200607173829
摘要
Background: Detecting DNA-binding proteins (DBPs) based on biological and chemical methods is time-consuming and expensive. Objective: In recent years, the rise of computational biology methods based on Machine Learning (ML) has greatly improved the detection efficiency of DBPs. Method: In this study, the Multiple Kernel-based Fuzzy SVM Model with Support Vector Data Description (MK-FSVM-SVDD) is proposed to predict DBPs. Firstly, sex features are extracted from the protein sequence. Secondly, multiple kernels are constructed via these sequence features. Then, multiple kernels are integrated by Centered Kernel Alignment-based Multiple Kernel Learning (CKA-MKL). Next, fuzzy membership scores of training samples are calculated with Support Vector Data Description (SVDD). FSVM is trained and employed to detect new DBPs. Results: Our model is evaluated on several benchmark datasets. Compared with other methods, MKFSVM- SVDD achieves best Matthew's Correlation Coefficient (MCC) on PDB186 (0.7250) and PDB2272 (0.5476). Conclusion: We can conclude that MK-FSVM-SVDD is more suitable than common SVM, as the classifier for DNA-binding proteins identification.
科研通智能强力驱动
Strongly Powered by AbleSci AI