🔥【活动通知】:科研通第二届『应助活动周』重磅启航,3月24-30日求助秒级响应🚀,千元现金等你拿。这个春天,让互助之光璀璨绽放!查看详情
清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

Interpretable online network dictionary learning for inferring long-range chromatin interactions

可解释性 计算机科学 聚类分析 理论计算机科学 可扩展性 人工智能 子网 机器学习 数据挖掘 计算机安全 数据库
作者
Vishal Rana,Jianhao Peng,Chao Pan,Hanbaek Lyu,Albert W. Cheng,Minji Kim,Olgica Milenković
出处
期刊:PLOS Computational Biology [Public Library of Science]
卷期号:20 (5): e1012095-e1012095
标识
DOI:10.1371/journal.pcbi.1012095
摘要

Dictionary learning (DL), implemented via matrix factorization (MF), is commonly used in computational biology to tackle ubiquitous clustering problems. The method is favored due to its conceptual simplicity and relatively low computational complexity. However, DL algorithms produce results that lack interpretability in terms of real biological data. Additionally, they are not optimized for graph-structured data and hence often fail to handle them in a scalable manner. In order to address these limitations, we propose a novel DL algorithm called online convex network dictionary learning (online cvxNDL). Unlike classical DL algorithms, online cvxNDL is implemented via MF and designed to handle extremely large datasets by virtue of its online nature. Importantly, it enables the interpretation of dictionary elements, which serve as cluster representatives, through convex combinations of real measurements. Moreover, the algorithm can be applied to data with a network structure by incorporating specialized subnetwork sampling techniques. To demonstrate the utility of our approach, we apply cvxNDL on 3D-genome RNAPII ChIA-Drop data with the goal of identifying important long-range interaction patterns (long-range dictionary elements). ChIA-Drop probes higher-order interactions, and produces data in the form of hypergraphs whose nodes represent genomic fragments. The hyperedges represent observed physical contacts. Our hypergraph model analysis has the objective of creating an interpretable dictionary of long-range interaction patterns that accurately represent global chromatin physical contact maps. Through the use of dictionary information, one can also associate the contact maps with RNA transcripts and infer cellular functions. To accomplish the task at hand, we focus on RNAPII-enriched ChIA-Drop data from Drosophila Melanogaster S2 cell lines. Our results offer two key insights. First, we demonstrate that online cvxNDL retains the accuracy of classical DL (MF) methods while simultaneously ensuring unique interpretability and scalability. Second, we identify distinct collections of proximal and distal interaction patterns involving chromatin elements shared by related processes across different chromosomes, as well as patterns unique to specific chromosomes. To associate the dictionary elements with biological properties of the corresponding chromatin regions, we employ Gene Ontology (GO) enrichment analysis and perform multiple RNA coexpression studies.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
直率的笑翠完成签到 ,获得积分10
23秒前
Owen应助芝士双皮奶采纳,获得10
28秒前
39秒前
zhao完成签到 ,获得积分10
43秒前
46秒前
芝士双皮奶完成签到,获得积分10
55秒前
小西完成签到 ,获得积分10
59秒前
深情安青应助科研通管家采纳,获得10
1分钟前
wsx4321完成签到,获得积分10
3分钟前
智慧金刚完成签到 ,获得积分10
3分钟前
3分钟前
北柑发布了新的文献求助10
3分钟前
zhaoyu完成签到 ,获得积分10
4分钟前
woxinyouyou完成签到,获得积分0
6分钟前
8分钟前
健壮丝袜发布了新的文献求助10
8分钟前
卡恩完成签到 ,获得积分10
9分钟前
zyjsunye完成签到 ,获得积分0
9分钟前
健壮丝袜完成签到,获得积分10
10分钟前
神外魔法师完成签到,获得积分10
10分钟前
jyy应助科研通管家采纳,获得10
11分钟前
FashionBoy应助科研通管家采纳,获得10
11分钟前
清似完成签到,获得积分10
12分钟前
MasterE完成签到,获得积分0
12分钟前
Vinaceliu完成签到,获得积分10
12分钟前
12分钟前
Stephen发布了新的文献求助10
12分钟前
Stephen完成签到,获得积分10
13分钟前
小芭乐完成签到 ,获得积分10
13分钟前
kyfbrahha完成签到 ,获得积分10
14分钟前
辛勤长颈鹿完成签到 ,获得积分10
15分钟前
Hayat发布了新的文献求助200
15分钟前
科研通AI2S应助科研通管家采纳,获得10
15分钟前
17分钟前
程南发布了新的文献求助10
17分钟前
月儿完成签到 ,获得积分10
17分钟前
17分钟前
Sylvia_J完成签到 ,获得积分10
18分钟前
18分钟前
zsmj23完成签到 ,获得积分0
19分钟前
高分求助中
Continuum Thermodynamics and Material Modelling 3000
Production Logging: Theoretical and Interpretive Elements 2700
Structural Load Modelling and Combination for Performance and Safety Evaluation 1000
Neuromuscular and Electrodiagnostic Medicine Board Review 800
Teaching language in context (3rd edition) by Derewianka, Beverly; Jones, Pauline 610
EEG in clinical practice 2nd edition 1994 600
Barth, Derrida and the Language of Theology 500
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 量子力学 光电子学 冶金
热门帖子
关注 科研通微信公众号,转发送积分 3600458
求助须知:如何正确求助?哪些是违规求助? 3169340
关于积分的说明 9560838
捐赠科研通 2875637
什么是DOI,文献DOI怎么找? 1579014
邀请新用户注册赠送积分活动 742341
科研通“疑难数据库(出版商)”最低求助积分说明 725177