表观遗传学
表观基因组
循环神经网络
自编码
代表(政治)
计算机科学
人工智能
表观遗传学
计算生物学
深度学习
人工神经网络
基因组学
生物
基因组
DNA甲基化
遗传学
基因
基因表达
政治
法学
政治学
作者
Kevin B. Dsouza,Adam Li,V.K. Bhargava,Maxwell W. Libbrecht
标识
DOI:10.1109/tcbb.2021.3084147
摘要
The availability of thousands of assays of epigenetic activity necessitates compressed representations of these data sets that summarize the epigenetic landscape of the genome. Until recently, most such representations were cell type-specific, applying to a single tissue or cell state. Recently, neural networks have made it possible to summarize data across tissues to produce a pan-cell type representation. In this work, we propose Epi-LSTM, a deep long short-term memory (LSTM) recurrent neural network autoencoder to capture the long-term dependencies in the epigenomic data. The latent representations from Epi-LSTM capture a variety of genomic phenomena, including gene-expression, promoter-enhancer interactions, replication timing, frequently interacting regions, and evolutionary conservation. These representations outperform existing methods in a majority of cell types while yielding smoother representations along the genomic axis due to their sequential nature.
科研通智能强力驱动
Strongly Powered by AbleSci AI