转录因子
计算生物学
生物
染色质
RNA序列
卷积神经网络
基因
基因组
基因调控网络
抄写(语言学)
计算机科学
人工智能
遗传学
基因表达
转录组
语言学
哲学
作者
Jingpeng Liu,Ximiao Shi,Zhitai Zhang,Xuexiang Cen,Lixian Lin,Xiaowei Wang,Zhongxian Chen,Yu Zhang,Xiangzi Zheng,Binghua Wu,Ying Miao
摘要
ABSTRACT Drought is a critical risk factor that impacts rice growth and yields. Previous studies have focused on the regulatory roles of individual transcription factors in response to drought stress. However, there is limited understanding of multi‐factor stresses gene regulatory networks and their mechanisms of action. In this study, we utilised data from the JASPAR database to compile a comprehensive dataset of transcription factors and their binding sites in rice, Arabidopsis, and barley genomes. We employed the PyTorch framework for machine learning to develop a nine‐layer convolutional deep neural network TFBind. Subsequently, we obtained rice RNA‐seq and ATAC‐seq data related to abiotic stress from the public database. Utilising integrative analysis of WGCNA and ATAC‐seq, we effectively identified transcription factors associated with open chromatin regions in response to drought. Interestingly, only 81% of the transcription factors directly bound to the opened genes by testing with TFBind model. By this approach we identified 15 drought‐responsive transcription factors corresponding to open chromatin regions of targets, which enriched in the terms related to protein transport, protein allocation, nitrogen compound transport. This approach provides a valuable tool for predicting TF‐TAG‐opened modules during biological processes.
科研通智能强力驱动
Strongly Powered by AbleSci AI