清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

New Method for Sequence Similarity Analysis Based on the Position and Frequency of Statistically Significant Repeats

相似性(几何) 序列(生物学) 算法 职位(财务) 序列标志 序列分析 聚类分析 序列比对 模式识别(心理学) 核酸序列 数学 等级制度 计算生物学 计算机科学 人工智能 共识序列 生物 遗传学 DNA 基序列 肽序列 图像(数学) 基因 财务 经济 市场经济
作者
Jasmina Jovanović
出处
期刊:Current Bioinformatics [Bentham Science Publishers]
卷期号:16 (10): 1299-1310
标识
DOI:10.2174/1574893616999210805165628
摘要

Background: The analysis of DNA nucleotide sequence similarity among different species is crucial in identifying their functional, structural or evolutionary relationships. The number of bioinformatics tools designed to perform the similarity analysis of nucleotide sequences has been growing rapidly. According to the current literature, alignment-free methods have not been performed on repetitive nucleotide sequence of different lengths. Objective: To develop a new algorithm for determining sequence characteristics and similarity based on statistically significant repetitive elements of different lengths, which are located in analyzed sequences. Methods: This paper presents Repeats-Position/Frequency method (R-P/F method), for determining nucleotide sequence similarity which takes into consideration statistically significant repetitive parts of analyzed sequences. It is based on information theory and the fact that both position and frequency of repeated sequences are not expected to occur with the identical presence in a random sequence of the same length. Nucleotide sequences are presented in rn-dimensional vector space and their hierarchy is constructed by applying hierarchical clustering algorithm. Results: R-P/F method has been validated on multiple data sets of nucleotide sequences and compared with results obtained from alignment-based algorithms BLAST and Clustal Omega, and multiple wellestablished alignment-free dissimilarity measures. Presented method provides results comparable with other commonly used methods focused on resolving the same problem, with the novel view on the used repetitive parts of sequences in these calculations. Conclusion: The presented, novel algorithm for calculating sequence similarity measure is effective in discovering relationships among the sequences and makes a powerful and complementary addition to existing sequence similarity methods.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
luckkit完成签到 ,获得积分10
2秒前
10秒前
cugwzr完成签到,获得积分10
14秒前
xuesensu完成签到 ,获得积分10
14秒前
自觉石头完成签到 ,获得积分10
19秒前
小苔藓完成签到 ,获得积分10
25秒前
心静自然好完成签到 ,获得积分10
25秒前
满意的寒凝完成签到 ,获得积分10
50秒前
57秒前
fabea完成签到,获得积分10
1分钟前
Pan完成签到,获得积分10
1分钟前
酷酷小子完成签到 ,获得积分10
1分钟前
1分钟前
kenchilie完成签到 ,获得积分10
1分钟前
小静完成签到 ,获得积分10
1分钟前
啥时候吃火锅完成签到 ,获得积分0
1分钟前
1分钟前
高大觅夏完成签到 ,获得积分10
1分钟前
快乐随心完成签到 ,获得积分10
2分钟前
2分钟前
liuliu完成签到 ,获得积分10
2分钟前
喜悦香萱完成签到 ,获得积分10
2分钟前
2分钟前
研友_V8RDYn完成签到,获得积分10
2分钟前
开心的大娘完成签到,获得积分10
2分钟前
lili完成签到 ,获得积分10
2分钟前
2分钟前
3分钟前
3分钟前
Raymond完成签到,获得积分10
3分钟前
南风完成签到 ,获得积分10
3分钟前
番茄小超人2号完成签到 ,获得积分10
3分钟前
小小王完成签到 ,获得积分10
4分钟前
CHEN完成签到 ,获得积分10
4分钟前
轩辕剑身完成签到,获得积分10
4分钟前
4分钟前
mmm完成签到 ,获得积分10
4分钟前
沉沉完成签到 ,获得积分0
4分钟前
xiaoblue完成签到,获得积分10
4分钟前
4分钟前
高分求助中
All the Birds of the World 4000
Production Logging: Theoretical and Interpretive Elements 3000
Animal Physiology 2000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Am Rande der Geschichte : mein Leben in China / Ruth Weiss 1500
CENTRAL BOOKS: A BRIEF HISTORY 1939 TO 1999 by Dave Cope 1000
Machine Learning Methods in Geoscience 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3736714
求助须知:如何正确求助?哪些是违规求助? 3280668
关于积分的说明 10020218
捐赠科研通 2997394
什么是DOI,文献DOI怎么找? 1644527
邀请新用户注册赠送积分活动 782060
科研通“疑难数据库(出版商)”最低求助积分说明 749656