公制(单位)
正确性
水准点(测量)
集合(抽象数据类型)
统计的
连续性
基因组
计算机科学
质量(理念)
生物
完备性(序理论)
计算生物学
数据挖掘
数学
算法
遗传学
基因
工程类
统计
运营管理
哲学
数学分析
操作系统
认识论
程序设计语言
地理
大地测量学
标识
DOI:10.1016/j.tig.2022.10.005
摘要
Quality control is essential for genome assemblies; however, a consensus has yet to be reached on what metrics should be adopted for the evaluation of assembly quality. N50 is widely used for contiguity measurement, but its effectiveness is constantly in question. Prevailing metrics for the completeness evaluation focus on gene space, yet challenging areas such as tandem repeats are commonly overlooked. Achieving correctness has become an indispensable dimension for quality control, while prevailing assembly releases lack scores reflecting this aspect. We propose a metric set with a set of statistic indexes for effective, comprehensive evaluation of assemblies and provide a score of a finished assembly for each metric, which can be utilized as a benchmark for achieving high-quality genome assemblies.
科研通智能强力驱动
Strongly Powered by AbleSci AI