亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

A benchmarking study of individual somatic variant callers and voting-based ensembles for whole-exome sequencing

索引 体细胞 标杆管理 外显子组测序 外显子组 计算机科学 投票 计算生物学 机器学习 生物 遗传学 突变 基因 基因型 单核苷酸多态性 政治 政治学 业务 营销 法学
作者
Arnaud Guillé,José Adélaı̈de,Pascal Finetti,Fabrice André,Daniel Birnbaum,Émilie Mamessier,François Bertucci,Max Chaffanet
出处
期刊:Briefings in Bioinformatics [Oxford University Press]
卷期号:26 (1)
标识
DOI:10.1093/bib/bbae697
摘要

Abstract By identifying somatic mutations, whole-exome sequencing (WES) has become a technology of choice for the diagnosis and guiding treatment decisions in many cancers. Despite advances in the field of somatic variant detection and the emergence of sophisticated tools incorporating machine learning, accurately identifying somatic variants remains challenging. Each new somatic variant caller is often accompanied by claims of superior performance compared to predecessors. Furthermore, most comparative studies focus on a limited set of tools and reference datasets, leading to inconsistent results and making it difficult for laboratories to select the optimal solution. Our study comprehensively evaluated 20 somatic variant callers across four reference WES datasets. We subsequently assessed the performance of ensemble approaches by exploring all possible combinations of these callers, generating 8178 and 1013 combinations for single-nucleotide variants (SNVs) and indels, respectively, with varying voting thresholds. Our analysis identified five high-performing individual somatic variant callers: Muse, Mutect2, Dragen, TNScope, and NeuSomatic. For somatic SNVs, an ensemble combining LoFreq, Muse, Mutect2, SomaticSniper, Strelka, and Lancet outperformed the top-performing caller (Dragen) by >3.6% (mean F1 score = 0.927). Similarly, for somatic indels, an ensemble of Mutect2, Strelka, Varscan2, and Pindel outperformed the best individual caller (Neusomatic) by >3.5% (mean F1 score = 0.867). By considering the computational costs of each combination, we were able to identify an optimal solution involving four somatic variant callers, Muse, Mutect2, and Strelka for the SNVs and Mutect2, Strelka, and Varscan2 for the indels, enabling accurate and cost-effective somatic variant detection in whole exome.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Chan发布了新的文献求助10
4秒前
20秒前
Suyi发布了新的文献求助10
27秒前
28秒前
28秒前
丘比特应助含蓄戾采纳,获得10
29秒前
34秒前
华仔应助Chan采纳,获得10
36秒前
39秒前
39秒前
含蓄戾完成签到,获得积分10
40秒前
NattyPoe完成签到,获得积分10
41秒前
41秒前
含蓄戾发布了新的文献求助10
43秒前
47秒前
51秒前
eosin发布了新的文献求助10
52秒前
52秒前
Chan完成签到,获得积分10
54秒前
55秒前
破碎虚空发布了新的文献求助10
56秒前
59秒前
SciGPT应助eosin采纳,获得10
1分钟前
ff发布了新的文献求助10
1分钟前
奋斗的小笼包完成签到 ,获得积分10
1分钟前
1分钟前
八田完成签到,获得积分10
1分钟前
ff发布了新的文献求助10
1分钟前
1分钟前
vicky完成签到 ,获得积分10
1分钟前
1分钟前
21完成签到,获得积分10
1分钟前
1分钟前
1分钟前
小二郎应助曹大壮采纳,获得10
1分钟前
雷鸣惊动发布了新的文献求助10
1分钟前
AllRightReserved应助Tayzon采纳,获得10
1分钟前
fanhuaxuejin发布了新的文献求助10
1分钟前
1分钟前
海绵宝宝完成签到 ,获得积分10
1分钟前
高分求助中
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
久松真一著作集〈第5巻〉禅と芸術 500
Fundamentals of Modern Mathematics: A Practical Review (Dover Books on Mathematics) 500
Cold War Transcended: Australia's China Policy, 1949-1990 470
Metal–Organic Frameworks in Analytical Chemistry 400
Cybercrime: The Transformation of Crime in the Information Age, 2nd Edition 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6609696
求助须知:如何正确求助?哪些是违规求助? 8376360
关于积分的说明 17922920
捐赠科研通 5772063
什么是DOI,文献DOI怎么找? 2957541
邀请新用户注册赠送积分活动 1932722
关于科研通互助平台的介绍 1832697