The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features

生物 基因组 康蒂格 遗传学 单倍型 同步 后转座子 基因 细菌人工染色体 顺序装配 倍性 基因组学 计算生物学
作者
Weihong Qi,Yi-Wen Lim,Andrea Patrignani,Pascal Schläpfer,Anna Bratus-Neuenschwander,Simon Grüter,Christelle Chanez,Nathalie Rodde,Elisa Prat,Sonia Vautrin,Margaux-Alison Fustier,Diogo Pratas,Ralph Schlapbach,Wilhelm Gruissem
出处
期刊:GigaScience [University of Oxford]
卷期号:11
标识
DOI:10.1093/gigascience/giac028
摘要

Cassava (Manihot esculenta) is an important clonally propagated food crop in tropical and subtropical regions worldwide. Genetic gain by molecular breeding has been limited, partially because cassava is a highly heterozygous crop with a repetitive and difficult-to-assemble genome.Here we demonstrate that Pacific Biosciences high-fidelity (HiFi) sequencing reads, in combination with the assembler hifiasm, produced genome assemblies at near complete haplotype resolution with higher continuity and accuracy compared to conventional long sequencing reads. We present 2 chromosome-scale haploid genomes phased with Hi-C technology for the diploid African cassava variety TME204. With consensus accuracy >QV46, contig N50 >18 Mb, BUSCO completeness of 99%, and 35k phased gene loci, it is the most accurate, continuous, complete, and haplotype-resolved cassava genome assembly so far. Ab initio gene prediction with RNA-seq data and Iso-Seq transcripts identified abundant novel gene loci, with enriched functionality related to chromatin organization, meristem development, and cell responses. During tissue development, differentially expressed transcripts of different haplotype origins were enriched for different functionality. In each tissue, 20-30% of transcripts showed allele-specific expression (ASE) differences. ASE bias was often tissue specific and inconsistent across different tissues. Direction-shifting was observed in <2% of the ASE transcripts. Despite high gene synteny, the HiFi genome assembly revealed extensive chromosome rearrangements and abundant intra-genomic and inter-genomic divergent sequences, with large structural variations mostly related to LTR retrotransposons. We use the reference-quality assemblies to build a cassava pan-genome and demonstrate its importance in representing the genetic diversity of cassava for downstream reference-guided omics analysis and breeding.The phased and annotated chromosome pairs allow a systematic view of the heterozygous diploid genome organization in cassava with improved accuracy, completeness, and haplotype resolution. They will be a valuable resource for cassava breeding and research. Our study may also provide insights into developing cost-effective and efficient strategies for resolving complex genomes with high resolution, accuracy, and continuity.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
夏天完成签到,获得积分10
刚刚
ljf完成签到,获得积分10
1秒前
1秒前
香蕉觅云应助haochi采纳,获得10
2秒前
孤独的匕完成签到,获得积分10
3秒前
龙抬头完成签到,获得积分10
3秒前
淮竹完成签到,获得积分10
4秒前
永溺深海的猫完成签到,获得积分10
4秒前
ZMH完成签到,获得积分10
5秒前
HOPKINSON发布了新的文献求助10
5秒前
我爱科研完成签到,获得积分10
6秒前
pluto应助谭语君采纳,获得10
7秒前
yolo发布了新的文献求助30
7秒前
高大的易蓉完成签到,获得积分10
8秒前
小刘爱科研完成签到,获得积分10
8秒前
zlxxxx完成签到,获得积分10
8秒前
隐形曼青应助羽化成环采纳,获得10
10秒前
嘻嘻完成签到 ,获得积分10
13秒前
斯文败类应助PDIF-CN2采纳,获得10
15秒前
李爱国应助Zllu采纳,获得10
15秒前
Leohp完成签到,获得积分10
15秒前
魁123完成签到 ,获得积分10
15秒前
燕燕完成签到,获得积分10
16秒前
平常的雁凡完成签到,获得积分20
16秒前
Cecilia0928完成签到,获得积分10
17秒前
满天星完成签到,获得积分10
18秒前
forerunner完成签到 ,获得积分10
18秒前
lilac完成签到,获得积分10
19秒前
健忘丹珍完成签到,获得积分10
19秒前
科研小垃圾完成签到,获得积分0
19秒前
满地给满地的求助进行了留言
20秒前
20秒前
20秒前
21秒前
枣核儿完成签到,获得积分10
21秒前
iOhyeye23完成签到 ,获得积分10
23秒前
往事随风完成签到,获得积分10
23秒前
superlit完成签到,获得积分10
24秒前
许七安完成签到,获得积分10
24秒前
小段爱学习完成签到 ,获得积分10
25秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Introduction to Helicopter and Tiltrotor Flight Simulation, Second Edition 2500
卤化钙钛矿人工突触的研究 2000
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Software that combines deep learning,3D reconstruction and CFD to analyze the state of carotid arteries from ultrasound imaging 600
Bounds for Statistical Estimation in Semiparametric Models 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6498307
求助须知:如何正确求助?哪些是违规求助? 8294269
关于积分的说明 17697224
捐赠科研通 5594352
什么是DOI,文献DOI怎么找? 2917610
邀请新用户注册赠送积分活动 1894577
关于科研通互助平台的介绍 1755252