Tumour purity as an underlying key factor in tumour mutation detection in colorectal cancer

结直肠癌 医学 肿瘤科 钥匙(锁) 突变 癌症 内科学 癌症研究 计算机科学 基因 遗传学 生物 计算机安全
作者
Tao Yu,Qianpeng Huang,Xinyu Zhao,Shiyao Zhang,Qi Zhang,Xingcan Fan,Gang Liu
出处
期刊:Clinical and translational medicine [Springer Science+Business Media]
卷期号:13 (5)
标识
DOI:10.1002/ctm2.1252
摘要

The emergence of next-generation sequencing (NGS) technology has enabled the large-scale identification of personalised genetic characteristics of colorectal cancer (CRC).1 However, the accuracy may be influenced by certain sample factors, such as sampling methods, biospecimen type (fresh vs. formalin-fixed paraffin-embedded) and input DNA amount.2-4 We creatively performed a contrastive analysis based on homogenous paired real-world surgical tumour specimens to comprehensively assess the impact of low tumour cell fraction on the authenticity of somatic mutation calling. Initially, we identified the correlation between the genomic mutation profile called by MuTect2 and the corresponding tumour purity from three public datasets: The Cancer Genome Atlas (TCGA) (n = 535),5 MSK-IMPACT (n = 941)6 and MSK-MetTropist (n = 3470).7 Samples with low tumour purity were widespread in real-world NGS datasets (Figure 1A). The quantity of mutations and tumour purity had a favourable correlation (Figure 1B). Similar results were found in TCGA called by MuSE, SomaticSniper and VarScan2 (Figure S1A). To verify the impact of tumour cell fraction on the variant allele frequency (VAF) of mutated genes, we classified the database samples into high-fraction and low-fraction groups based on the median tumour cell fraction. The VAF of majority of common hotspot variants in high tumour cell fraction samples displayed considerably higher or positive correlational trends than those in the low tumour cell fraction samples (Figure 1C). Similar results were also observed when the variants were called using MuTect2, MuSE and VarScan2 algorithms in TCGA (Figure S1B). These findings suggested that the number and VAF of variants may be significantly underestimated in low tumour cell fraction samples. Then, we systematically evaluated the impact of tumour cell fraction on the fidelity of NGS with 30 surgical specimens using a targeted NGS platform including exon of 437 cancer-associated genes and intron of 62 genes where fusion usually happens (1.53 Mb). The detailed clinicopathological parameters of the patient cohort are shown in Table S1. Paired serial-sectioned samples after tumour purity assessment were alternately divided into precise-sampling and routine-sampling groups according to the sampling sequence. Precise scratching sampling for tumour-specific tissue was performed in precise-sampling groups so as to improve the tumour purity (Figure 2A) (Supplementary Method 1). The clinical–genomic features of 30 paired CRC samples with precision sampling and routine sampling are summarised in Figure 2B. A total of 250 mutations were private to the precise-sampling group, 23 mutations were private to the routine-sampling group, and 439 mutations were shared mutations (Figure S3). The distribution of variants under changes in tumour purity are shown in Figure 2C. The number of mutations after precise sampling was significantly increased in low tumour purity group (Figure 3D). The VAFs of common hotspot mutations were also considerably increased in precise-sampling group (Figure 3E). There was an increasing trend in the number of genes with copy number variations (CNVs) after precise sampling, and the copy number values of genes in the precise-sampling group changed obviously compared with those in the routine-sampling group (Figure 3F). Tumour mutational burden was also underestimated in low-purity sample group (Figure 3G). To further rule out an influence of pathological factors on mutation detection, we performed subgroup analyses according to location, staging and differentiation of tumour and similar outcomes were confirmed (Figure S4). These findings showed that pathological parameters had no effect on the bias influence of tumour purity on mutation detection. We further identified the optimal tumour purity threshold for calling mutations, and we regarded precise-sampling private mutations as false-negative mutations (FNMs) and routine-sampling private mutations as false-positive mutations (FPMs). The proportion of patients with FNMs gradually reduced as tumour purity increased, while the proportion of patients with FPMs was relatively small and showed no correlation with tumour purity (Figure 3A). We evaluated the accuracy of mutation detection using the F-score (Supplementary Method 2). The accuracy of mutation detection increased as the tumour purity increased (Figure 3A). We further investigated the reason for the poor accuracy of mutation detection in low tumour cell fraction samples by analysing the association between the false-negative/-positive rate and tumour purity. The false-negative rate of the samples decreased as tumour purity increased, while the false-positive rate was not significantly correlated with tumour purity (Figure 3B). The variants were then described and displayed based on their VAF. When the tumour purity was <30%, there were many FNMs with high-VAF variants. The number of FNMs decreased significantly when the tumour purity was >30%, and most of these were low-VAF variants (Figure 3C). In contrast, there was no connection between the quantity of FPMs and tumour cell fraction, and the detected FPMs were very low-VAF variants (Figure 3D). Using case 20 as an example, we assessed the impact of tumour cell fraction. on mutation detection with whole exome sequencing (WES). The tumour purity of routine samples evaluated by pathologist and WES were 22.5% and 25%, respectively. The tumour purity was 100% after precise sampling. When compared to the routine sample, the precise sample had more genes with single nucleotide variants (SNVs) and indels (Figure 4A). The number and extent of CNVs that were amplified or deleted increased after precise sampling. The minor allele frequency (MAF) distribution preference and number of heterozygosity deletions were also underestimated in the low-purity samples (Figure 4B). Due to many variants with low VAF were found, the number of subclones inferred after precise sampling was eight as opposed to two in routine sample (Figure 4C). Similar results were also observed in case 22 (Figure S5). Low tumour purity may affect the evaluation of mutation spectra and signatures, while cluster analysis with known mutation characteristics showed that differences in these factors did not affect the explanation of the carcinogenic mechanism (Figure S6). More drivers and target genes were detected after precise sampling (Figure S7). In conclusion, we unveil that tumour purity acts as an independent and significant influencing factor and should be taken into consideration when evaluating genomic characterisation using NGS detection in CRC. Above 30% of tumour purity might be suitable for clinical applications in precision oncology and a higher tumour fraction could promote the accuracy of WES for assessing mutational and clonal landscapes. We thank Susan Furness, PhD, from Liwen Bianji (Edanz) (www.liwenbianji.cn) for editing the English text of a draft of this manuscript. The authors declare they have no conflicts of interest. Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
等待雪瑶发布了新的文献求助10
刚刚
2秒前
何东浩发布了新的文献求助10
2秒前
3秒前
4秒前
彩色的奄完成签到,获得积分10
4秒前
科研通AI5应助powozhi13579采纳,获得10
4秒前
5秒前
博修完成签到,获得积分10
6秒前
wyj0815应助子云采纳,获得10
6秒前
科研通AI5应助shijie805采纳,获得10
7秒前
spenley发布了新的文献求助10
7秒前
7秒前
88发布了新的文献求助10
8秒前
Sean完成签到,获得积分10
8秒前
enchanted完成签到 ,获得积分10
9秒前
perfect完成签到 ,获得积分10
9秒前
林林完成签到,获得积分10
10秒前
善良的远锋完成签到,获得积分10
11秒前
a1423072381发布了新的文献求助10
12秒前
传奇3应助666采纳,获得10
12秒前
eliot完成签到,获得积分10
14秒前
任性的咖啡完成签到,获得积分10
14秒前
spenley完成签到,获得积分10
15秒前
19秒前
wind完成签到,获得积分10
20秒前
20秒前
良辰应助木子采纳,获得10
21秒前
科研通AI2S应助木子采纳,获得10
21秒前
田様应助木子采纳,获得10
21秒前
galeno完成签到 ,获得积分10
22秒前
23秒前
水水发布了新的文献求助10
23秒前
powozhi13579发布了新的文献求助10
24秒前
26秒前
CC完成签到,获得积分10
27秒前
LYNN完成签到,获得积分10
28秒前
微笑念瑶完成签到,获得积分10
30秒前
88完成签到,获得积分20
30秒前
31秒前
高分求助中
Production Logging: Theoretical and Interpretive Elements 2700
Neuromuscular and Electrodiagnostic Medicine Board Review 1000
こんなに痛いのにどうして「なんでもない」と医者にいわれてしまうのでしょうか 510
The First Nuclear Era: The Life and Times of a Technological Fixer 500
Unusual formation of 4-diazo-3-nitriminopyrazoles upon acid nitration of pyrazolo[3,4-d][1,2,3]triazoles 500
岡本唐貴自伝的回想画集 500
Distinct Aggregation Behaviors and Rheological Responses of Two Terminally Functionalized Polyisoprenes with Different Quadruple Hydrogen Bonding Motifs 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3671635
求助须知:如何正确求助?哪些是违规求助? 3228335
关于积分的说明 9779690
捐赠科研通 2938645
什么是DOI,文献DOI怎么找? 1610206
邀请新用户注册赠送积分活动 760547
科研通“疑难数据库(出版商)”最低求助积分说明 736093