生物
表达数量性状基因座
连锁不平衡
遗传学
变化(天文学)
数量性状位点
单倍型
结构变异
联动装置(软件)
人口
遗传变异
选择性拼接
RNA剪接
基因型
进化生物学
计算生物学
基因
单核苷酸多态性
外显子
基因组
天体物理学
物理
核糖核酸
人口学
社会学
作者
Alexander S. Leonard,Xena Marie Mapel,Hubert Pausch
出处
期刊:Genome Research
[Cold Spring Harbor Laboratory]
日期:2024-02-14
卷期号:: gr.278267.123-gr.278267.123
被引量:3
标识
DOI:10.1101/gr.278267.123
摘要
Expression and splicing quantitative trait loci (e/sQTL) are large contributors to phenotypic variability. Achieving sufficient statistical power for e/sQTL mapping requires large cohorts with both genotypes and molecular phenotypes, and so the genomic variation is often called from short-read alignments which are unable to comprehensively resolve structural variation. Here we build a pangenome from 16 HiFi haplotype-resolved assemblies to identify small and structural variation and genotype them with PanGenie in 307 short-read samples. We find high (>90%) concordance of PanGenie-genotyped and DeepVariant-called small variation, and confidently genotype close to 21M small and 43k structural variants in the larger population. We validate 85% of these structural variants (with MAF>0.1) directly with a subset of 25 short-read samples that also have medium coverage HiFi reads. We then conduct e/sQTL mapping with this comprehensive variant set in a subset of 117 cattle that have testis transcriptome data and find 92 structural variants as causal candidates for eQTL and 73 for sQTL. We find that roughly half of top associated structural variants affecting expression or splicing are transposable elements, such as SV-eQTLs for STN1 and MYH7 and SV-sQTLs for CEP89 and ASAH2 . Extensive linkage disequilibrium between small and structural variation results in only 28 additional eQTL and 17 sQTL discovered when including SVs, although many top associated SVs are compelling candidates.
科研通智能强力驱动
Strongly Powered by AbleSci AI