循环神经网络
计算机科学
人工智能
定制
背景(考古学)
人工神经网络
生成语法
肽
序列(生物学)
肽序列
构造(python库)
深度学习
生成模型
计算生物学
机器学习
生物
程序设计语言
生物化学
古生物学
政治学
基因
法学
作者
Alex T. Müller,Jan A. Hiss,Gisbert Schneider
标识
DOI:10.1021/acs.jcim.7b00414
摘要
We present a generative long short-term memory (LSTM) recurrent neural network (RNN) for combinatorial de novo peptide design. RNN models capture patterns in sequential data and generate new data instances from the learned context. Amino acid sequences represent a suitable input for these machine-learning models. Generative models trained on peptide sequences could therefore facilitate the design of bespoke peptide libraries. We trained RNNs with LSTM units on pattern recognition of helical antimicrobial peptides and used the resulting model for de novo sequence generation. Of these sequences, 82% were predicted to be active antimicrobial peptides compared to 65% of randomly sampled sequences with the same amino acid distribution as the training set. The generated sequences also lie closer to the training data than manually designed amphipathic helices. The results of this study showcase the ability of LSTM RNNs to construct new amino acid sequences within the applicability domain of the model and motivate their prospective application to peptide and protein design without the need for the exhaustive enumeration of sequence libraries.
科研通智能强力驱动
Strongly Powered by AbleSci AI