生命银行
插补(统计学)
人口
生物
遗传学
基因型
人类遗传变异
生物沉积
等位基因
基因组
人类基因组
缺少数据
医学
基因
计算机科学
环境卫生
机器学习
作者
Clare Bycroft,Colin Freeman,Desislava Petkova,Gavin Band,Lloyd T. Elliott,Kevin Sharp,Allan Motyer,Damjan Vukcevic,Olivier Delaneau,Jared O’Connell,Adrián Cortés,Samantha Welsh,A. P. Young,Mark Effingham,Gil McVean,Stephen Leslie,Naomi E. Allen,Peter Donnelly,Jonathan Marchini
出处
期刊:Nature
[Springer Nature]
日期:2018-10-01
卷期号:562 (7726): 203-209
被引量:6695
标识
DOI:10.1038/s41586-018-0579-z
摘要
The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases. Deep phenotype and genome-wide genetic data from 500,000 individuals from the UK Biobank, describing population structure and relatedness in the cohort, and imputation to increase the number of testable variants to 96 million.
科研通智能强力驱动
Strongly Powered by AbleSci AI