panHiTE: a comprehensive and accurate pipeline for TE detection in large-scale population genomes

管道(软件) 比例(比率) 基因组 人口 计算生物学 计算机科学 数据科学 地理 生物 地图学 遗传学 医学 环境卫生 基因 程序设计语言
作者
Kang Hu,Minghua Xu,Jianxin Wang
标识
DOI:10.1101/2025.02.15.638472
摘要

Transposable elements (TEs) are key drivers of genomic variation and species evolution. Advances in high-throughput sequencing have enabled whole-genome sequencing of individuals or subspecies, facilitating the identification of population-specific variations. Detecting population-specific TE insertions at scale is crucial for understanding species-specific phenotypic traits. However, tools for constructing comprehensive pan-TE databases remain limited. To address this gap, we develop panHiTE, a population-scale TE detection and annotation tool with several core innovations. panHiTE features a deep learning-based long terminal repeat retrotransposon (LTR-RT) detection algorithm, outperforming existing tools in both sensitivity and precision. It also introduces a novel de-redundancy algorithm, which eliminates highly divergent redundant TE instances, significantly reducing the size of the TE library. Additionally, panHiTE can detect low-copy TEs, which are overlooked in individual genome analyses and absent from existing databases due to their rarity. Furthermore, panHiTE allows for TE-gene association analysis, enabling comprehensive insights into TE-driven phenotypic variation. panHiTE, powered by a Nextflow pipeline, enables efficient and scalable TE detection in large plant genomes and has successfully been applied to hundreds of plant population genomes, demonstrating its effectiveness and scalability.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
李健应助伶俐的高烽采纳,获得10
1秒前
袋袋完成签到,获得积分10
1秒前
1秒前
诉衷情完成签到,获得积分10
1秒前
四木木发布了新的文献求助10
1秒前
思源应助vvv采纳,获得10
1秒前
1秒前
菠萝完成签到,获得积分10
2秒前
外向半青完成签到,获得积分10
2秒前
Vera发布了新的文献求助10
2秒前
filory发布了新的文献求助10
2秒前
2秒前
Tammy完成签到,获得积分10
3秒前
清达发布了新的文献求助10
3秒前
高大厉发布了新的文献求助50
3秒前
4秒前
伶俐的不尤完成签到,获得积分10
4秒前
诉衷情发布了新的文献求助10
5秒前
5秒前
范志辉应助溪鱼采纳,获得20
6秒前
CodeCraft应助林夕采纳,获得10
6秒前
科研通AI2S应助明芬采纳,获得10
6秒前
6秒前
满满完成签到,获得积分10
7秒前
hhh完成签到,获得积分10
7秒前
7秒前
7秒前
zhouzhou发布了新的文献求助10
8秒前
8秒前
8秒前
蛋黄的阿爸完成签到,获得积分10
8秒前
紫薰发布了新的文献求助10
8秒前
8秒前
清秀的绫发布了新的文献求助10
9秒前
9秒前
9秒前
9秒前
yyy发布了新的文献求助10
10秒前
ikun发布了新的文献求助10
10秒前
10秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Lewis’s Child and Adolescent Psychiatry: A Comprehensive Textbook Sixth Edition 2000
Cronologia da história de Macau 1600
Treatment response-adapted risk index model for survival prediction and adjuvant chemotherapy selection in nonmetastatic nasopharyngeal carcinoma 1000
Lloyd's Register of Shipping's Approach to the Control of Incidents of Brittle Fracture in Ship Structures 1000
BRITTLE FRACTURE IN WELDED SHIPS 1000
Toughness acceptance criteria for rack materials and weldments in jack-ups 800
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6207141
求助须知:如何正确求助?哪些是违规求助? 8033523
关于积分的说明 16733641
捐赠科研通 5298038
什么是DOI,文献DOI怎么找? 2822823
邀请新用户注册赠送积分活动 1801834
关于科研通互助平台的介绍 1663378