启发式
计算机科学
相似性(几何)
吞吐量
算法
管道(软件)
灵敏度(控制系统)
组分(热力学)
自由序列分析
仿射变换
序列比对
生物
数据挖掘
理论计算机科学
人工智能
数学
程序设计语言
图像(数学)
电信
生物化学
物理
电子工程
基因
纯数学
肽序列
热力学
无线
工程类
作者
Kunhyung Bahk,Joohon Sung
摘要
Abstract In biological sequence alignment, prevailing heuristic aligners achieve high-throughput by several approximation techniques, but at the cost of sacrificing the clarity of output criteria and creating complex parameter spaces. To surmount these challenges, we introduce ‘SigAlign’, a novel alignment algorithm that employs two explicit cutoffs for the results: minimum length and maximum penalty per length, alongside three affine gap penalties. Comparative analyses of SigAlign against leading database search tools (BLASTn, MMseqs2) and read mappers (BWA-MEM, bowtie2, HISAT2, minimap2) highlight its performance in read mapping and database searches. Our research demonstrates that SigAlign not only provides high sensitivity with a non-heuristic approach, but also surpasses the throughput of existing heuristic aligners, particularly for high-accuracy reads or genomes with few repetitive regions. As an open-source library, SigAlign is poised to become a foundational component to provide a transparent and customizable alignment process to new analytical algorithms, tools and pipelines in bioinformatics.
科研通智能强力驱动
Strongly Powered by AbleSci AI