作者
Ran Li,Mian Gong,Xinmiao Zhang,Fei Wang,Zhenyu Liu,Lei Zhang,Mengsi Xu,Yunfeng Zhang,Xuelei Dai,Zhuangbiao Zhang,Wenwen Fang,Yifan Yang,Huan‐Huan Zhang,Weiwei Fu,Chunna Cao,Peng Yang,Zeinab Amiri Ghanatsaman,Niloufar Jafarpour Negari,Hojjat Asadollahpour Nanaei,Xiangpeng Yue,Yuxuan Song,Xianyong Lan,Weidong Deng,Xihong Wang,Ruidong Xiang,Eveline M. Ibeagha‐Awemu,Pat Heslop-Harrison,Johannes A. Lenstra,Shangquan Gan,Yu Jiang
摘要
Abstract Structural variations (SVs) are a major contributor to genetic diversity and phenotypic variations, but their prevalence and functions in domestic animals are largely unexplored. Here, we assembled 26 haplotype-resolved genome assemblies from 13 genetically diverse sheep using PacBio HiFi sequencing. We constructed a graph-based ovine pan-genome and discovered 142,422 biallelic insertions and deletions, 7,028 divergent alleles and 13,419 multiallelic variations. We then used a graph-based approach to genotype the biallelic SVs in 684 individuals from 45 domestic breeds and two wild species. Integration with RNA-seq data allows to identify candidate expression-associated SVs. We demonstrate a direct link of SVs and phenotypes by localizing the putative causative insertion in HOXB13 gene responsible for the long-tail trait and identifying multiple large SVs associated with the fat-tail. Beyond generating a benchmark resource for ovine structural variants, our study highlights that animal genetic research will greatly benefit from using a pan-genome graph rather than a single reference genome.