生物
遗传学
单倍型
全基因组关联研究
次等位基因频率
肺癌
SNP公司
等位基因
遗传关联
单核苷酸多态性
等位基因频率
插补(统计学)
肿瘤科
基因
基因型
医学
机器学习
计算机科学
缺少数据
作者
Cheng Wang,Juncheng Dai,Na Qin,Jingyi Fan,Hongxia Ma,Congcong Chen,Mingxing An,Jing Zhang,Caiwang Yan,Yayun Gu,Yuan Xie,Yuanlin He,Yue Jiang,Meng Zhu,Ci Song,Tao Jiang,Jia Liu,Jun Zhou,Nanxi Wang,Tingting Hua
出处
期刊:Cancer Cell
[Cell Press]
日期:2022-09-15
卷期号:40 (10): 1223-1239.e6
被引量:39
标识
DOI:10.1016/j.ccell.2022.08.013
摘要
We present the largest whole-genome sequencing (WGS) study of non-small cell lung cancer (NSCLC) to date among 6,004 individuals of Chinese ancestry, coupled with 23,049 individuals genotyped by SNP array. We construct a high-quality haplotype reference panel for imputation and identify 20 common and low-frequency loci (minor allele frequency [MAF] ≥ 0.5%), including five loci that have never been reported before. For rare loss-of-function (LoF) variants (MAF < 0.5%), we identify BRCA2 and 18 other cancer predisposition genes that affect 5.29% of individuals with NSCLC, and 98.91% (181 of 183) of LoF variants have not been linked previously to NSCLC risk. Promoter variants of BRCA2 also have a substantial effect on NSCLC risk, and their prevalence is comparable with BRCA2 LoF variants. The associations are validated in an independent case-control study including 4,410 individuals and a prospective cohort study including 23,826 individuals. Our findings not only provide a high-quality reference panel for future array-based association studies but depict the whole picture of rare pathogenic variants for NSCLC.
科研通智能强力驱动
Strongly Powered by AbleSci AI