背景(考古学)
代表(政治)
计算机科学
人工智能
酶
序列(生物学)
机器学习
特征(语言学)
降级(电信)
模式识别(心理学)
生物
生物化学
古生物学
哲学
政治
法学
电信
语言学
政治学
作者
Renjing Jiang,Lanyu Shang,Ruohan Wang,Dong Wang,Na Wei
出处
期刊:Environmental Science and Technology Letters
[American Chemical Society]
日期:2023-06-19
卷期号:10 (7): 557-564
被引量:12
标识
DOI:10.1021/acs.estlett.3c00293
摘要
Enzyme biocatalysis for plastic treatment and recycling is an emerging field of growing interest. However, it is challenging and time-consuming to identify plastic-degrading enzymes with desirable functionality, given the large number of putative enzyme sequences. There is a critical need to develop an effective approach to accurately predict the enzyme activity in degrading different types of plastics. In this study, we developed a machine-learning-based plastic enzymatic degradation (PED) framework to predict the ability of an enzyme to degrade plastics of interest by exploring and recognizing hidden patterns in protein sequences. A data set integrating information from a wide range of experimentally verified enzymes and various common plastic substrates was created. A new context-aware enzyme sequence representation (CESR) mechanism was developed to learn the abundant contextual information in enzyme sequences, and feature extraction was performed for enzymes at both the amino acid level and global sequence level. Thirteen machine learning classification algorithms were compared, and XGBoost was identified as the best-performing algorithm. PED achieved an overall accuracy of 90.2% and outperformed sequence-based protein classification models from the existing literature. Furthermore, important enzyme features in plastic degradation were identified and comprehensively interpreted. This study demonstrated a new tool for the prediction and discovery of plastic-degrading enzymes.
科研通智能强力驱动
Strongly Powered by AbleSci AI