m6Aminer: Predicting the m6Am Sites on mRNA by Fusing Multiple Sequence-Derived Features into a CatBoost-Based Classifier

计算机科学 分类器(UML) 模式识别(心理学) 人工智能 特征(语言学) 特征提取 数据挖掘 计算生物学 生物 哲学 语言学
作者
Ze Liu,Pengfei Lan,Ting Liu,Xudong Li,Tao Liu
出处
期刊:International Journal of Molecular Sciences [MDPI AG]
卷期号:24 (9): 7878-7878 被引量:1
标识
DOI:10.3390/ijms24097878
摘要

As one of the most important post-transcriptional modifications, m6Am plays a fairly important role in conferring mRNA stability and in the progression of cancers. The accurate identification of the m6Am sites is critical for explaining its biological significance and developing its application in the medical field. However, conventional experimental approaches are time-consuming and expensive, making them unsuitable for the large-scale identification of the m6Am sites. To address this challenge, we exploit a CatBoost-based method, m6Aminer, to identify the m6Am sites on mRNA. For feature extraction, nine different feature-encoding schemes (pseudo electron-ion interaction potential, hash decimal conversion method, dinucleotide binary encoding, nucleotide chemical properties, pseudo k-tuple composition, dinucleotide numerical mapping, K monomeric units, series correlation pseudo trinucleotide composition, and K-spaced nucleotide pair frequency) were utilized to form the initial feature space. To obtain the optimized feature subset, the ExtraTreesClassifier algorithm was adopted to perform feature importance ranking, and the top 300 features were selected as the optimal feature subset. With different performance assessment methods, 10-fold cross-validation and independent test, m6Aminer achieved average AUC of 0.913 and 0.754, demonstrating a competitive performance with the state-of-the-art models m6AmPred (0.905 and 0.735) and DLm6Am (0.897 and 0.730). The prediction model developed in this study can be used to identify the m6Am sites in the whole transcriptome, laying a foundation for the functional research of m6Am.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
3秒前
3秒前
liu完成签到,获得积分20
5秒前
研友_nV2ROn完成签到,获得积分10
5秒前
cdliuchao发布了新的文献求助10
8秒前
liang19640908完成签到 ,获得积分10
9秒前
Hysen_L完成签到,获得积分10
12秒前
眼睛大盼兰完成签到 ,获得积分10
13秒前
Severan发布了新的文献求助10
15秒前
Wayne72完成签到,获得积分10
16秒前
IMFINE完成签到 ,获得积分10
17秒前
18秒前
19秒前
22秒前
22秒前
25秒前
张瀚文发布了新的文献求助10
25秒前
东木完成签到,获得积分10
25秒前
26秒前
30秒前
CAOHOU举报han求助涉嫌违规
31秒前
cdliuchao发布了新的文献求助10
32秒前
Owen应助柏含卉采纳,获得10
32秒前
姜茶完成签到 ,获得积分0
34秒前
团团完成签到 ,获得积分10
34秒前
Fung完成签到,获得积分10
35秒前
36秒前
36秒前
拼搏绿柏完成签到,获得积分10
39秒前
科研通AI2S应助坚强的严青采纳,获得10
39秒前
枯藤老树昏呀完成签到,获得积分10
40秒前
42秒前
43秒前
43秒前
cdliuchao完成签到,获得积分10
44秒前
搞怪的语风完成签到,获得积分10
48秒前
tz发布了新的文献求助10
48秒前
50秒前
太叔振家完成签到,获得积分10
51秒前
53秒前
高分求助中
Tracking and Data Fusion: A Handbook of Algorithms 1000
Models of Teaching(The 10th Edition,第10版!)《教学模式》(第10版!) 800
La décision juridictionnelle 800
Rechtsphilosophie und Rechtstheorie 800
Academic entitlement: Adapting the equity preference questionnaire for a university setting 500
Full waveform acoustic data processing 400
Bounded Meaning 400
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2877806
求助须知:如何正确求助?哪些是违规求助? 2491276
关于积分的说明 6743830
捐赠科研通 2172672
什么是DOI,文献DOI怎么找? 1154612
版权声明 586096
科研通“疑难数据库(出版商)”最低求助积分说明 566823