管道(软件)
CTCF公司
染色质
生物
预处理器
数据科学
计算机科学
计算生物学
基因组学
基因组
生物信息学
人工智能
数据挖掘
机器学习
遗传学
基因
基因表达
增强子
程序设计语言
作者
Anup Kumar Halder,Abhishek Agarwal,Karolina Jodkowska,Dariusz Plewczyński
出处
期刊:Briefings in Functional Genomics
[Oxford University Press]
日期:2024-03-30
被引量:1
摘要
Abstract Genomic data analysis has witnessed a surge in complexity and volume, primarily driven by the advent of high-throughput technologies. In particular, studying chromatin loops and structures has become pivotal in understanding gene regulation and genome organization. This systematic investigation explores the realm of specialized bioinformatics pipelines designed specifically for the analysis of chromatin loops and structures. Our investigation incorporates two protein (CTCF and Cohesin) factor-specific loop interaction datasets from six distinct pipelines, amassing a comprehensive collection of 36 diverse datasets. Through a meticulous review of existing literature, we offer a holistic perspective on the methodologies, tools and algorithms underpinning the analysis of this multifaceted genomic feature. We illuminate the vast array of approaches deployed, encompassing pivotal aspects such as data preparation pipeline, preprocessing, statistical features and modelling techniques. Beyond this, we rigorously assess the strengths and limitations inherent in these bioinformatics pipelines, shedding light on the interplay between data quality and the performance of deep learning models, ultimately advancing our comprehension of genomic intricacies.
科研通智能强力驱动
Strongly Powered by AbleSci AI