效应器
背景(考古学)
计算机科学
标杆管理
选择(遗传算法)
集合预报
机器学习
集成学习
预测建模
分泌物
特征选择
特征(语言学)
计算生物学
人工智能
数据挖掘
生物
语言学
哲学
营销
业务
细胞生物学
古生物学
生物化学
作者
Yi An,Jiawei Wang,Chen Li,André Leier,Tatiana T. Marquez‐Lago,Jonathan J. Wilksch,Yang Zhang,Geoffrey I. Webb,Jiangning Song,Trevor Lithgow
摘要
Bacterial effector proteins secreted by various protein secretion systems play crucial roles in host–pathogen interactions. In this context, computational tools capable of accurately predicting effector proteins of the various types of bacterial secretion systems are highly desirable. Existing computational approaches use different machine learning (ML) techniques and heterogeneous features derived from protein sequences and/or structural information. These predictors differ not only in terms of the used ML methods but also with respect to the used curated data sets, the features selection and their prediction performance. Here, we provide a comprehensive survey and benchmarking of currently available tools for the prediction of effector proteins of bacterial types III, IV and VI secretion systems (T3SS, T4SS and T6SS, respectively). We review core algorithms, feature selection techniques, tool availability and applicability and evaluate the prediction performance based on carefully curated independent test data sets. In an effort to improve predictive performance, we constructed three ensemble models based on ML algorithms by integrating the output of all individual predictors reviewed. Our benchmarks demonstrate that these ensemble models outperform all the reviewed tools for the prediction of effector proteins of T3SS and T4SS. The webserver of the proposed ensemble methods for T3SS and T4SS effector protein prediction is freely available at http://tbooster.erc.monash.edu/index.jsp. We anticipate that this survey will serve as a useful guide for interested users and that the new ensemble predictors will stimulate research into host–pathogen relationships and inspiration for the development of new bioinformatics tools for predicting effector proteins of T3SS, T4SS and T6SS.
科研通智能强力驱动
Strongly Powered by AbleSci AI