Calling large indels in 1047 Arabidopsis with IndelEnsembler

计算生物学 拟南芥 基因 突变体
作者
Dongxu Liu,Ramesh Rajaby,Lulu Wei,Lei Zhang,Zhiquan Yang,Qingyong Yang,Wing-Kin Sung
出处
期刊:Nucleic Acids Research [Oxford University Press]
卷期号:49 (19): 10879-10894 被引量:9
标识
DOI:10.1093/nar/gkab904
摘要

Large indels greatly impact the observable phenotypes in different organisms including plants and human. Hence, extracting large indels with high precision and sensitivity is important. Here, we developed IndelEnsembler to detect large indels in 1047 Arabidopsis whole-genome sequencing data. IndelEnsembler identified 34 093 deletions, 12 913 tandem duplications and 9773 insertions. Our large indel dataset was more comprehensive and accurate compared with the previous dataset of AthCNV (1). We captured nearly twice of the ground truth deletions and on average 27% more ground truth duplications compared with AthCNV, though our dataset has less number of large indels compared with AthCNV. Our large indels were positively correlated with transposon elements across the Arabidopsis genome. The non-homologous recombination events were the major formation mechanism of deletions in Arabidopsis genome. The Neighbor joining (NJ) tree constructed based on IndelEnsembler's deletions clearly divided the geographic subgroups of 1047 Arabidopsis. More importantly, our large indels represent a previously unassessed source of genetic variation. Approximately 49% of the deletions have low linkage disequilibrium (LD) with surrounding single nucleotide polymorphisms. Some of them could affect trait performance. For instance, using deletion-based genome-wide association study (DEL-GWAS), the accessions containing a 182-bp deletion in AT1G11520 had delayed flowering time and all accessions in north Sweden had the 182-bp deletion. We also found the accessions with 65-bp deletion in the first exon of AT4G00650 (FRI) flowered earlier than those without it. These two deletions cannot be detected in AthCNV and, interestingly, they do not co-occur in any Arabidopsis thaliana accession. By SNP-GWAS, surrounding SNPs of these two deletions do not correlate with flowering time. This example demonstrated that existing large indel datasets miss phenotypic variations and our large indel dataset filled in the gap.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Eleven完成签到,获得积分10
2秒前
hoy完成签到 ,获得积分10
3秒前
MMMMM完成签到,获得积分10
6秒前
liu完成签到,获得积分10
7秒前
14秒前
欢呼的傲霜完成签到 ,获得积分10
16秒前
布吉岛呀完成签到 ,获得积分10
18秒前
风清扬完成签到,获得积分0
20秒前
会飞的螃蟹完成签到,获得积分10
20秒前
共享精神应助傲慢与偏见采纳,获得10
21秒前
小爱要奋斗完成签到 ,获得积分10
21秒前
王诗琪完成签到,获得积分10
23秒前
Gamera完成签到 ,获得积分10
24秒前
DKX完成签到 ,获得积分10
26秒前
非而者厚应助xh采纳,获得10
26秒前
现代半山完成签到 ,获得积分10
27秒前
某只橘猫君完成签到,获得积分10
28秒前
wxxz完成签到,获得积分10
28秒前
hhh完成签到 ,获得积分10
29秒前
29秒前
朴素的幻然完成签到,获得积分10
32秒前
晒透发布了新的文献求助10
34秒前
Conner完成签到 ,获得积分0
34秒前
轨迹发布了新的文献求助10
35秒前
xelloss完成签到,获得积分10
36秒前
37秒前
油菜花完成签到,获得积分10
38秒前
所所应助科研通管家采纳,获得10
38秒前
洁净的酬海完成签到 ,获得积分10
38秒前
无花果应助科研通管家采纳,获得10
38秒前
ywzwszl完成签到,获得积分0
41秒前
美丽语蝶发布了新的文献求助10
44秒前
王不凡完成签到,获得积分10
44秒前
老福贵儿完成签到,获得积分0
45秒前
想吃糖葫芦完成签到 ,获得积分10
47秒前
月涵完成签到 ,获得积分10
47秒前
闫栋完成签到 ,获得积分10
48秒前
dayday完成签到,获得积分10
49秒前
我不会乱起名字的完成签到,获得积分10
51秒前
苑世朝完成签到,获得积分10
53秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Lewis’s Child and Adolescent Psychiatry: A Comprehensive Textbook Sixth Edition 2000
Engineering for calcareous sediments : proceedings of the International Conference on Calcareous Sediments, Perth 15-18 March 1988 / edited by R.J. Jewell, D.C. Andrews 1000
Wolffs Headache and Other Head Pain 9th Edition 1000
Continuing Syntax 1000
Signals, Systems, and Signal Processing 510
Austrian Economics: An Introduction 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6229960
求助须知:如何正确求助?哪些是违规求助? 8054629
关于积分的说明 16795621
捐赠科研通 5311681
什么是DOI,文献DOI怎么找? 2829194
邀请新用户注册赠送积分活动 1807013
关于科研通互助平台的介绍 1665427