发起人
计算生物学
序列(生物学)
合成生物学
遗传学
调节顺序
生物
DNA
抄写(语言学)
计算机科学
转录因子
基因
基因表达
语言学
哲学
作者
Pengcheng Zhang,Haochen Wang,Hanwen Xu,Lei Wei,Liyang Liu,Zhirui Hu,Xiaowo Wang
标识
DOI:10.1038/s41467-023-41899-y
摘要
Designing promoters with desirable properties is essential in synthetic biology. Human experts are skilled at identifying strong explicit patterns in small samples, while deep learning models excel at detecting implicit weak patterns in large datasets. Biologists have described the sequence patterns of promoters via transcription factor binding sites (TFBSs). However, the flanking sequences of cis-regulatory elements, have long been overlooked and often arbitrarily decided in promoter design. To address this limitation, we introduce DeepSEED, an AI-aided framework that efficiently designs synthetic promoters by combining expert knowledge with deep learning techniques. DeepSEED has demonstrated success in improving the properties of Escherichia coli constitutive, IPTG-inducible, and mammalian cell doxycycline (Dox)-inducible promoters. Furthermore, our results show that DeepSEED captures the implicit features in flanking sequences, such as k-mer frequencies and DNA shape features, which are crucial for determining promoter properties.
科研通智能强力驱动
Strongly Powered by AbleSci AI