已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

An Effective Algorithm Based on Sequence and Property Information for N4-methylcytosine Identification in Multiple Species

鉴定(生物学) 序列(生物学) 5-甲基胞嘧啶 财产(哲学) 化学 算法 计算生物学 计算机科学 生物化学 生物 基因 DNA甲基化 植物 基因表达 哲学 认识论
作者
Lichao Zhang,Xueting Wang,Kang Xiao,Liang Kong
出处
期刊:Letters in Organic Chemistry [Bentham Science]
卷期号:21 (8): 695-706
标识
DOI:10.2174/0115701786277281231228093405
摘要

Abstract: N4-methylcytosine (4mC) is one of the most important epigenetic modifications, which plays a significant role in biological progress and helps explain biological functions. Although biological experiments can identify potential 4mC sites, they are limited due to the experimental environment and labor-intensive process. Therefore, it is crucial to construct a computational model to identify the 4mC sites. Some computational methods have been proposed to identify the 4mC sites, but some problems should not be ignored, such as those presented as follows: (1) a more accurate algorithm is required to improve the prediction, especially for Matthew’s correlation coefficient (MCC); (2) easier method is needed for clinical research to design medicine or treat disease. Considering these aspects, an effective algorithm using comprehensible encoding in multiple species was proposed in this study. Since nucleotide arrangement and its property information could reflect the sequence structure and function, several feature vectors have been developed based on nucleotide energy information, trinucleotide energy information, and nucleotide chemical property information. Besides, feature effect has been analyzed to select the optimal feature vectors for multiple species. Finally, the optimal feature vectors were inputted into the CatBoost algorithm to construct the identification model. The evaluation results showed that our study obtained the highest MCC, i.e., 2.5%~11.1%, 1.4%~17.8%, 1.1%~7.6%, and 2.3%~18.0% higher than previous models for the A. thaliana, C. elegans, D. melanogaster, and E. coli datasets, respectively. These satisfactory results reflect that the proposed method is available to identify 4mC sites in multiple species, especially for MCC. It could provide a reasonable supplement for biological research.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
zeee发布了新的文献求助20
1秒前
岁岁完成签到,获得积分10
2秒前
2秒前
平常溪流发布了新的文献求助10
3秒前
青年才俊发布了新的文献求助30
5秒前
5秒前
9秒前
血茗完成签到 ,获得积分10
10秒前
10秒前
回忆告白发布了新的文献求助10
10秒前
orixero应助ordin采纳,获得10
10秒前
ding应助科研通管家采纳,获得10
12秒前
共享精神应助科研通管家采纳,获得10
12秒前
情怀应助科研通管家采纳,获得10
12秒前
酷波er应助科研通管家采纳,获得10
12秒前
科研通AI2S应助科研通管家采纳,获得10
12秒前
12秒前
知足关注了科研通微信公众号
13秒前
14秒前
mmyhn发布了新的文献求助10
15秒前
努力的宁发布了新的文献求助10
16秒前
传奇3应助sahjdkah采纳,获得10
17秒前
zeee完成签到,获得积分10
18秒前
19秒前
科研狗发布了新的文献求助10
19秒前
21秒前
青年才俊发布了新的文献求助30
21秒前
哎呀妈呀发布了新的文献求助10
22秒前
23秒前
ordin发布了新的文献求助10
24秒前
fifteen发布了新的文献求助30
25秒前
25秒前
25秒前
BGdream完成签到,获得积分10
26秒前
26秒前
完美世界应助南桉采纳,获得10
27秒前
在水一方发布了新的文献求助10
28秒前
30秒前
sahjdkah发布了新的文献求助10
31秒前
Chai发布了新的文献求助10
31秒前
高分求助中
Exploring Mitochondrial Autophagy Dysregulation in Osteosarcoma: Its Implications for Prognosis and Targeted Therapy 4000
Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 2000
Evolution 1100
How to Create Beauty: De Lairesse on the Theory and Practice of Making Art 1000
Research Methods for Sports Studies 1000
Gerard de Lairesse : an artist between stage and studio 670
T/CAB 0344-2024 重组人源化胶原蛋白内毒素去除方法 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 内科学 物理 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 免疫学 病理 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 2979915
求助须知:如何正确求助?哪些是违规求助? 2641053
关于积分的说明 7123480
捐赠科研通 2273759
什么是DOI,文献DOI怎么找? 1206130
版权声明 591942
科研通“疑难数据库(出版商)”最低求助积分说明 589460