Tumour purity as an underlying key factor in tumour mutation detection in colorectal cancer

结直肠癌 医学 肿瘤科 钥匙(锁) 突变 癌症 内科学 癌症研究 计算机科学 基因 遗传学 生物 计算机安全
作者
Tao Yu,Qianpeng Huang,Xinyu Zhao,Shiyao Zhang,Qi Zhang,Xingcan Fan,Gang Liu
出处
期刊:Clinical and translational medicine [Wiley]
卷期号:13 (5)
标识
DOI:10.1002/ctm2.1252
摘要

The emergence of next-generation sequencing (NGS) technology has enabled the large-scale identification of personalised genetic characteristics of colorectal cancer (CRC).1 However, the accuracy may be influenced by certain sample factors, such as sampling methods, biospecimen type (fresh vs. formalin-fixed paraffin-embedded) and input DNA amount.2-4 We creatively performed a contrastive analysis based on homogenous paired real-world surgical tumour specimens to comprehensively assess the impact of low tumour cell fraction on the authenticity of somatic mutation calling. Initially, we identified the correlation between the genomic mutation profile called by MuTect2 and the corresponding tumour purity from three public datasets: The Cancer Genome Atlas (TCGA) (n = 535),5 MSK-IMPACT (n = 941)6 and MSK-MetTropist (n = 3470).7 Samples with low tumour purity were widespread in real-world NGS datasets (Figure 1A). The quantity of mutations and tumour purity had a favourable correlation (Figure 1B). Similar results were found in TCGA called by MuSE, SomaticSniper and VarScan2 (Figure S1A). To verify the impact of tumour cell fraction on the variant allele frequency (VAF) of mutated genes, we classified the database samples into high-fraction and low-fraction groups based on the median tumour cell fraction. The VAF of majority of common hotspot variants in high tumour cell fraction samples displayed considerably higher or positive correlational trends than those in the low tumour cell fraction samples (Figure 1C). Similar results were also observed when the variants were called using MuTect2, MuSE and VarScan2 algorithms in TCGA (Figure S1B). These findings suggested that the number and VAF of variants may be significantly underestimated in low tumour cell fraction samples. Then, we systematically evaluated the impact of tumour cell fraction on the fidelity of NGS with 30 surgical specimens using a targeted NGS platform including exon of 437 cancer-associated genes and intron of 62 genes where fusion usually happens (1.53 Mb). The detailed clinicopathological parameters of the patient cohort are shown in Table S1. Paired serial-sectioned samples after tumour purity assessment were alternately divided into precise-sampling and routine-sampling groups according to the sampling sequence. Precise scratching sampling for tumour-specific tissue was performed in precise-sampling groups so as to improve the tumour purity (Figure 2A) (Supplementary Method 1). The clinical–genomic features of 30 paired CRC samples with precision sampling and routine sampling are summarised in Figure 2B. A total of 250 mutations were private to the precise-sampling group, 23 mutations were private to the routine-sampling group, and 439 mutations were shared mutations (Figure S3). The distribution of variants under changes in tumour purity are shown in Figure 2C. The number of mutations after precise sampling was significantly increased in low tumour purity group (Figure 3D). The VAFs of common hotspot mutations were also considerably increased in precise-sampling group (Figure 3E). There was an increasing trend in the number of genes with copy number variations (CNVs) after precise sampling, and the copy number values of genes in the precise-sampling group changed obviously compared with those in the routine-sampling group (Figure 3F). Tumour mutational burden was also underestimated in low-purity sample group (Figure 3G). To further rule out an influence of pathological factors on mutation detection, we performed subgroup analyses according to location, staging and differentiation of tumour and similar outcomes were confirmed (Figure S4). These findings showed that pathological parameters had no effect on the bias influence of tumour purity on mutation detection. We further identified the optimal tumour purity threshold for calling mutations, and we regarded precise-sampling private mutations as false-negative mutations (FNMs) and routine-sampling private mutations as false-positive mutations (FPMs). The proportion of patients with FNMs gradually reduced as tumour purity increased, while the proportion of patients with FPMs was relatively small and showed no correlation with tumour purity (Figure 3A). We evaluated the accuracy of mutation detection using the F-score (Supplementary Method 2). The accuracy of mutation detection increased as the tumour purity increased (Figure 3A). We further investigated the reason for the poor accuracy of mutation detection in low tumour cell fraction samples by analysing the association between the false-negative/-positive rate and tumour purity. The false-negative rate of the samples decreased as tumour purity increased, while the false-positive rate was not significantly correlated with tumour purity (Figure 3B). The variants were then described and displayed based on their VAF. When the tumour purity was <30%, there were many FNMs with high-VAF variants. The number of FNMs decreased significantly when the tumour purity was >30%, and most of these were low-VAF variants (Figure 3C). In contrast, there was no connection between the quantity of FPMs and tumour cell fraction, and the detected FPMs were very low-VAF variants (Figure 3D). Using case 20 as an example, we assessed the impact of tumour cell fraction. on mutation detection with whole exome sequencing (WES). The tumour purity of routine samples evaluated by pathologist and WES were 22.5% and 25%, respectively. The tumour purity was 100% after precise sampling. When compared to the routine sample, the precise sample had more genes with single nucleotide variants (SNVs) and indels (Figure 4A). The number and extent of CNVs that were amplified or deleted increased after precise sampling. The minor allele frequency (MAF) distribution preference and number of heterozygosity deletions were also underestimated in the low-purity samples (Figure 4B). Due to many variants with low VAF were found, the number of subclones inferred after precise sampling was eight as opposed to two in routine sample (Figure 4C). Similar results were also observed in case 22 (Figure S5). Low tumour purity may affect the evaluation of mutation spectra and signatures, while cluster analysis with known mutation characteristics showed that differences in these factors did not affect the explanation of the carcinogenic mechanism (Figure S6). More drivers and target genes were detected after precise sampling (Figure S7). In conclusion, we unveil that tumour purity acts as an independent and significant influencing factor and should be taken into consideration when evaluating genomic characterisation using NGS detection in CRC. Above 30% of tumour purity might be suitable for clinical applications in precision oncology and a higher tumour fraction could promote the accuracy of WES for assessing mutational and clonal landscapes. We thank Susan Furness, PhD, from Liwen Bianji (Edanz) (www.liwenbianji.cn) for editing the English text of a draft of this manuscript. The authors declare they have no conflicts of interest. Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Lucky完成签到,获得积分10
1秒前
好好发布了新的文献求助10
1秒前
1秒前
疯狂的虔完成签到,获得积分10
3秒前
勤劳的薯片完成签到,获得积分10
4秒前
5秒前
阿巴阿巴完成签到,获得积分20
5秒前
典雅傲芙发布了新的文献求助10
5秒前
吉祥应助robi采纳,获得30
6秒前
6秒前
lilingyi发布了新的文献求助30
7秒前
8秒前
jojo完成签到,获得积分10
9秒前
玉碧完成签到,获得积分20
9秒前
9秒前
orixero应助王帅采纳,获得10
10秒前
lzq完成签到 ,获得积分10
10秒前
石头发布了新的文献求助10
10秒前
Liujing2022发布了新的文献求助10
11秒前
11秒前
12秒前
12秒前
是咸鱼呀完成签到,获得积分10
12秒前
科目三应助Xue采纳,获得10
12秒前
yisa发布了新的文献求助10
12秒前
14秒前
Mzb完成签到,获得积分10
14秒前
14秒前
大头完成签到 ,获得积分10
14秒前
史一豆完成签到 ,获得积分10
14秒前
14秒前
Yu发布了新的文献求助10
16秒前
dwls完成签到,获得积分10
16秒前
lina发布了新的文献求助10
17秒前
所所应助广旭采纳,获得10
18秒前
小小灯笼发布了新的文献求助10
18秒前
石头完成签到,获得积分10
19秒前
英俊的筝发布了新的文献求助10
19秒前
一一一一完成签到,获得积分10
19秒前
LLL发布了新的文献求助10
19秒前
高分求助中
Sustainability in Tides Chemistry 2800
The Young builders of New china : the visit of the delegation of the WFDY to the Chinese People's Republic 1000
Rechtsphilosophie 1000
Bayesian Models of Cognition:Reverse Engineering the Mind 888
Le dégorgement réflexe des Acridiens 800
Defense against predation 800
Very-high-order BVD Schemes Using β-variable THINC Method 568
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 3134819
求助须知:如何正确求助?哪些是违规求助? 2785712
关于积分的说明 7773883
捐赠科研通 2441585
什么是DOI,文献DOI怎么找? 1298006
科研通“疑难数据库(出版商)”最低求助积分说明 625075
版权声明 600825