生物
DNA损伤
计算生物学
DNA
遗传学
随机森林
计算机科学
机器学习
作者
Dongfang You,Na Qin,Mingzhi Zhang,Juncheng Dai,Mulong Du,Yongyue Wei,Ruyang Zhang,Zhibin Hu,David C. Christiani,Yang Zhao,Feng Chen
出处
期刊:Gene
[Elsevier]
日期:2020-03-09
卷期号:740: 144570-144570
被引量:2
标识
DOI:10.1016/j.gene.2020.144570
摘要
Recent studies have found multiple single nucleotide variants (SNVs) associated with DNA damage. However, previous association analysis may ignore the potential interaction effects between SNVs. Therefore, we used an improved random forest (RF) analysis to identify the SNVs related to personal DNA damage in exon-focused genome-wide association study (GWAS). A total of 301 subjects from three independent centers (Zhuhai, Wuhan, and Tianjin) were retained for analysis. An improved RF procedure was used to systematically screen key SNVs associated with DNA damage. Furthermore, we used genetic risk score (GRS) and mediation analysis to investigate the integrative effect and potential mechanism of these genetic variants on DNA damage. Besides, gene set enrichment analysis was conducted to identify the pathways enriched by key SNVs using the Data-driven Expression Prioritized Integration for Complex Traits (DEPICT). Finally, a set of 24 SNVs with the lowest mean square errors (MSE) were identified by improved RF analysis. Both weighted and unweighted GRSs were associated with increased DNA damage levels (Pweight < 0.001 and Punweight < 0.001). Gene set enrichment analysis indicated that these loci were significantly enriched in several biological features associated with DNA damage. These findings suggested the role of SNVs in modifying DNA damage levels. It may be convincing that this improved RF analysis can effectively identify SNVs associated with DNA damage levels.
科研通智能强力驱动
Strongly Powered by AbleSci AI