Interpretable online network dictionary learning for inferring long-range chromatin interactions

可解释性 计算机科学 聚类分析 理论计算机科学 可扩展性 人工智能 子网 机器学习 数据挖掘 计算机安全 数据库
作者
Vishal Rana,Jianhao Peng,Chao Pan,Hanbaek Lyu,Albert W. Cheng,Minji Kim,Olgica Milenković
出处
期刊:PLOS Computational Biology [Public Library of Science]
卷期号:20 (5): e1012095-e1012095
标识
DOI:10.1371/journal.pcbi.1012095
摘要

Dictionary learning (DL), implemented via matrix factorization (MF), is commonly used in computational biology to tackle ubiquitous clustering problems. The method is favored due to its conceptual simplicity and relatively low computational complexity. However, DL algorithms produce results that lack interpretability in terms of real biological data. Additionally, they are not optimized for graph-structured data and hence often fail to handle them in a scalable manner. In order to address these limitations, we propose a novel DL algorithm called online convex network dictionary learning (online cvxNDL). Unlike classical DL algorithms, online cvxNDL is implemented via MF and designed to handle extremely large datasets by virtue of its online nature. Importantly, it enables the interpretation of dictionary elements, which serve as cluster representatives, through convex combinations of real measurements. Moreover, the algorithm can be applied to data with a network structure by incorporating specialized subnetwork sampling techniques. To demonstrate the utility of our approach, we apply cvxNDL on 3D-genome RNAPII ChIA-Drop data with the goal of identifying important long-range interaction patterns (long-range dictionary elements). ChIA-Drop probes higher-order interactions, and produces data in the form of hypergraphs whose nodes represent genomic fragments. The hyperedges represent observed physical contacts. Our hypergraph model analysis has the objective of creating an interpretable dictionary of long-range interaction patterns that accurately represent global chromatin physical contact maps. Through the use of dictionary information, one can also associate the contact maps with RNA transcripts and infer cellular functions. To accomplish the task at hand, we focus on RNAPII-enriched ChIA-Drop data from Drosophila Melanogaster S2 cell lines. Our results offer two key insights. First, we demonstrate that online cvxNDL retains the accuracy of classical DL (MF) methods while simultaneously ensuring unique interpretability and scalability. Second, we identify distinct collections of proximal and distal interaction patterns involving chromatin elements shared by related processes across different chromosomes, as well as patterns unique to specific chromosomes. To associate the dictionary elements with biological properties of the corresponding chromatin regions, we employ Gene Ontology (GO) enrichment analysis and perform multiple RNA coexpression studies.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
josh完成签到,获得积分10
刚刚
龙龙猫发布了新的文献求助10
刚刚
xiao完成签到,获得积分20
1秒前
1秒前
1秒前
沉静的听寒完成签到,获得积分20
1秒前
2秒前
FashionBoy应助专注雁采纳,获得10
3秒前
3秒前
Owen应助专注雁采纳,获得10
3秒前
爆米花应助专注雁采纳,获得50
3秒前
赘婿应助专注雁采纳,获得10
3秒前
NexusExplorer应助专注雁采纳,获得10
3秒前
CipherSage应助专注雁采纳,获得10
3秒前
传奇3应助专注雁采纳,获得30
3秒前
科研通AI6.3应助专注雁采纳,获得10
3秒前
3秒前
乔乔发布了新的文献求助20
4秒前
4466完成签到,获得积分10
4秒前
神奇的种子完成签到,获得积分10
4秒前
5秒前
田様应助Inten采纳,获得10
5秒前
5秒前
方梦昕完成签到,获得积分10
6秒前
racill发布了新的文献求助10
6秒前
6秒前
xiw完成签到,获得积分10
7秒前
格格发布了新的文献求助10
8秒前
靓丽枫叶完成签到 ,获得积分10
8秒前
羡鱼发布了新的文献求助10
8秒前
8秒前
落寞剑成完成签到 ,获得积分10
9秒前
星弟发布了新的文献求助10
9秒前
9秒前
赘婿应助asdf采纳,获得10
10秒前
10秒前
ghdvg发布了新的文献求助10
11秒前
田様应助翔君采纳,获得10
12秒前
科研通AI2S应助翔君采纳,获得10
12秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Handbook of pharmaceutical excipients, Ninth edition 5000
Aerospace Standards Index - 2026 ASIN2026 3000
Polymorphism and polytypism in crystals 1000
Signals, Systems, and Signal Processing 610
Discrete-Time Signals and Systems 610
T/SNFSOC 0002—2025 独居石精矿碱法冶炼工艺技术标准 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6044738
求助须知:如何正确求助?哪些是违规求助? 7813092
关于积分的说明 16246129
捐赠科研通 5190459
什么是DOI,文献DOI怎么找? 2777385
邀请新用户注册赠送积分活动 1760617
关于科研通互助平台的介绍 1643767