DNA结合位点
转录因子
鉴定(生物学)
人工神经网络
计算机科学
计算生物学
结合位点
数据挖掘
相似性(几何)
基因组
人工智能
机器学习
基因
生物
遗传学
发起人
基因表达
生态学
图像(数学)
作者
Wenkai Yan,Zutan Li,Cong Pian,Yufeng Wu
摘要
Abstract Identification of transcription factor binding sites (TFBSs) is essential to understanding of gene regulation. Designing computational models for accurate prediction of TFBSs is crucial because it is not feasible to experimentally assay all transcription factors (TFs) in all sequenced eukaryotic genomes. Although many methods have been proposed for the identification of TFBSs in humans, methods designed for plants are comparatively underdeveloped. Here, we present PlantBind, a method for integrated prediction and interpretation of TFBSs based on DNA sequences and DNA shape profiles. Built on an attention-based multi-label deep learning framework, PlantBind not only simultaneously predicts the potential binding sites of 315 TFs, but also identifies the motifs bound by transcription factors. During the training process, this model revealed a strong similarity among TF family members with respect to target binding sequences. Trans-species prediction performance using four Zea mays TFs demonstrated the suitability of this model for transfer learning. Overall, this study provides an effective solution for identifying plant TFBSs, which will promote greater understanding of transcriptional regulatory mechanisms in plants.
科研通智能强力驱动
Strongly Powered by AbleSci AI