Trait Imputation Enhances Nonlinear Genetic Prediction for Some Traits

生物 特质 插补(统计学) 遗传学 数量性状位点 进化生物学 统计 基因 数学 缺少数据 计算机科学 程序设计语言
作者
Ruoyu He,Jinwen Fu,Jingchen Ren,Wei Pan
出处
期刊:Genetics [Oxford University Press]
标识
DOI:10.1093/genetics/iyae148
摘要

Abstract The expansive collection of genetic and phenotypic data within biobanks offers an unprecedented opportunity for biomedical research. However, the frequent occurrence of missing phenotypes presents a significant barrier to fully leveraging this potential. In our target application, on one hand, we have only a small and complete dataset with both genotypes and phenotypes to build a genetic prediction model, commonly called a polygenic (risk) score (PGS or PRS); on the other hand, we have a large dataset of genotypes (e.g. from a biobank) without the phenotype of interest. Our goal is to leverage the large dataset of genotypes (but without the phenotype) and a separate genome-wide association studies summary dataset of the phenotype to impute the phenotypes, which are then used as an individual-level dataset, along with the small complete dataset, to build a nonlinear model as PGS. More specifically, we trained some nonlinear models to 7 imputed and observed phenotypes from the UK Biobank data. We then trained an ensemble model to integrate these models for each trait, resulting in higher R2 values in prediction than using only the small complete (observed) dataset. Additionally, for 2 of the 7 traits, we observed that the nonlinear model trained with the imputed traits had higher R2 than using the imputed traits directly as the PGS, while for the remaining 5 traits, no improvement was found. These findings demonstrate the potential of leveraging existing genetic data and accounting for nonlinear genetic relationships to improve prediction accuracy for some traits.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
2秒前
4秒前
晨纯完成签到 ,获得积分10
6秒前
医者学也完成签到,获得积分10
11秒前
麦乐迪完成签到 ,获得积分10
11秒前
13秒前
空白格完成签到 ,获得积分10
18秒前
20秒前
桐桐应助zjw采纳,获得30
24秒前
隐形曼青应助zhangsenbing采纳,获得10
26秒前
27秒前
荣幸完成签到 ,获得积分10
28秒前
陈蓥完成签到 ,获得积分10
31秒前
40秒前
40秒前
46秒前
49秒前
zhangsenbing发布了新的文献求助10
50秒前
拟态橙完成签到 ,获得积分10
53秒前
57秒前
LLX123完成签到 ,获得积分10
1分钟前
犹豫代曼发布了新的文献求助10
1分钟前
1分钟前
高山流水完成签到 ,获得积分10
1分钟前
1分钟前
lxcy0612发布了新的文献求助10
1分钟前
1分钟前
1分钟前
zjw发布了新的文献求助30
1分钟前
倷倷完成签到 ,获得积分10
1分钟前
负责雨安完成签到 ,获得积分10
1分钟前
1分钟前
LIJIngcan完成签到 ,获得积分10
1分钟前
阿桔完成签到 ,获得积分10
1分钟前
忧心的藏鸟完成签到 ,获得积分10
1分钟前
帮我下一下完成签到,获得积分10
1分钟前
我是老大应助zhangsenbing采纳,获得10
1分钟前
xzx完成签到 ,获得积分10
2分钟前
五月完成签到 ,获得积分10
2分钟前
宁静致远QY完成签到,获得积分10
2分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Various Faces of Animal Metaphor in English and Polish 800
Signals, Systems, and Signal Processing 610
Adverse weather effects on bus ridership 500
Photodetectors: From Ultraviolet to Infrared 500
On the Dragon Seas, a sailor's adventures in the far east 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6350671
求助须知:如何正确求助?哪些是违规求助? 8165288
关于积分的说明 17182091
捐赠科研通 5406866
什么是DOI,文献DOI怎么找? 2862727
邀请新用户注册赠送积分活动 1840290
关于科研通互助平台的介绍 1689463