发布文献求助

SAPPHIRE: A stacking-based ensemble learning framework for accurate prediction of thermophilic proteins

蓝宝石计算机科学鉴定（生物学）人工智能特征（语言学）机器学习集合（抽象数据类型）基线（sea）数据挖掘算法海洋学光学物理地质学哲学生物植物程序设计语言激光器语言学

作者

Phasit Charoenkwan,Nalini Schaduangrat,Mohammad Ali Moni,Píetro Lió,Balachandran Manavalan,Watshara Shoombuatong

出处

期刊：Computers in Biology and Medicine [Elsevier]
日期：2022-06-07 卷期号：146: 105704-105704 被引量：40

链接

标识

DOI：10.1016/j.compbiomed.2022.105704

摘要

Thermophilic proteins (TPPs) are important in the field of protein biochemistry and development of new enzymes. Thus, computational methods must be urgently developed to accurately and rapidly identify TPPs. To date, several computational methods have been developed for TPP identification; however, few limitations in terms of performance and utility remain. In this study, we present a novel computational method, SAPPHIRE, to achieve more accurate identification of TPPs using only sequence information without any need for structural information. We combined twelve different feature encodings representing different perspectives and six popular machine learning algorithms to train 72 baseline models and extract the key information of TPPs. Subsequently, the informative predicted probabilities from the baseline models were mined and selected using a genetic algorithm in conjunction with a self-assessment-report approach. Finally, the final meta-predictor, SAPPHIRE, was built and optimized by applying an optimal feature set. The performance of SAPPHIRE in the 10-fold cross-validation test showed that a superior predictive performance compared with several baseline models could be achieved. Moreover, SAPPHIRE yielded an accuracy of 0.942 and Matthew's coefficient correlation of 0.884, which were 7.68 and 5.12% higher than those of the current existing methods, respectively, as indicated by the independent test. The proposed computational approach is anticipated to facilitate large-scale identification of TPPs and accelerate their applications in the food industry. The codes and datasets are available at https://github.com/plenoi/SAPPHIRE.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 踏实乐枫发布了新的文献求助10

3秒前; 8R60d8上传了应助文件

3秒前; 李爱国的应助被520采纳，获得10

3秒前; Hello的应助被细心的梦芝采纳，获得10

4秒前; 科研通AI2S上传了应助文件

5秒前; 大卫在分享的应助被归途采纳，获得10

5秒前; jyx的应助被归途采纳，获得10

5秒前; Doctor12th发布了新的文献求助10

6秒前; Mathew完成签到，获得积分20

6秒前; 仲夏完成签到，获得积分10

6秒前; 桐桐上传了应助文件

8秒前; 彭于晏的应助被闪耀的启明星采纳，获得10

11秒前; 踏实乐枫完成签到，获得积分20

13秒前; 雪白问兰的应助被有机分子笼采纳，获得10

13秒前; 世上无难事发布了新的文献求助10

13秒前; 头头的小豆包发布了新的文献求助10

15秒前; 8R60d8上传了应助文件

15秒前; Hello上传了应助文件

15秒前; 迟早发布了新的文献求助30

17秒前; 田様上传了应助文件

19秒前; 细心的梦芝发布了新的文献求助10

19秒前; 桐桐上传了应助文件

20秒前; 大模型的应助被weiwei采纳，获得10

20秒前; 义气谷兰完成签到，获得积分10

20秒前; Lucas的应助被飞快的鸵鸟采纳，获得10

22秒前; frinkle完成签到，获得积分10

23秒前; 科研通AI2S上传了应助文件

23秒前; 雨的前世发布了新的文献求助30

25秒前; 遇见渔火发布了新的文献求助10

26秒前; T1unkillable完成签到，获得积分10

27秒前; 头头的小豆包完成签到，获得积分10

32秒前; 8R60d8上传了应助文件

33秒前; 犹豫千筹驳回了科研通AI2.0的应助

33秒前; 大模型上传了应助文件

34秒前; 共享精神上传了应助文件

34秒前; 大模型的应助被冷静冰双采纳，获得10

34秒前; Solar energy完成签到，获得积分10

36秒前; 领导范儿上传了应助文件

36秒前; 赘婿上传了应助文件

37秒前; 共享精神上传了应助文件

37秒前

高分求助中: Evolution 10000; Becoming: An Introduction to Jung's Concept of Individuation 600; Ore genesis in the Zambian Copperbelt with particular reference to the northern sector of the Chambishi basin 500; A new species of Coccus (Homoptera: Coccoidea) from Malawi 500; A new species of Velataspis (Hemiptera Coccoidea Diaspididae) from tea in Assam 500; PraxisRatgeber: Mantiden: Faszinierende Lauerjäger 500; The Kinetic Nitration and Basicity of 1,2,4-Triazol-5-ones 440

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3164337; 求助须知：如何正确求助？哪些是违规求助？ 2815164; 关于积分的说明 7907823; 捐赠科研通 2474743; 什么是DOI，文献DOI怎么找？ 1317626; 科研通“疑难数据库（出版商）”最低求助积分说明 631898; 版权声明 602234

今日热心研友

吃不饱星球球长

互助遵法尚德

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：826996720【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通