可解释性
核酸
计算机科学
人工智能
变压器
计算生物学
深度学习
染色质
机器学习
DNA
生物
遗传学
工程类
电压
电气工程
作者
Shujun He,Baizhen Gao,Rushant Sabnis,Qing Sun
标识
DOI:10.1021/acssynbio.3c00154
摘要
Much work has been done to apply machine learning and deep learning to genomics tasks, but these applications usually require extensive domain knowledge, and the resulting models provide very limited interpretability. Here, we present the Nucleic Transformer, a conceptually simple but effective and interpretable model architecture that excels in the classification of DNA sequences. The Nucleic Transformer employs self-attention and convolutions on nucleic acid sequences, leveraging two prominent deep learning strategies commonly used in computer vision and natural language analysis. We demonstrate that the Nucleic Transformer can be trained without much domain knowledge to achieve high performance in Escherichia coli promoter classification, viral genome identification, enhancer classification, and chromatin profile predictions.
科研通智能强力驱动
Strongly Powered by AbleSci AI