计算机科学
深度学习
初始化
人工智能
可用性
集合(抽象数据类型)
实施
接口(物质)
人工神经网络
序列(生物学)
编码(集合论)
源代码
机器学习
学习迁移
人机交互
程序设计语言
最大气泡压力法
气泡
生物
并行计算
遗传学
作者
Runyu Jing,Yizhou Li,Xue Li,Fengjuan Liu,Menglong Li,Jiesi Luo
标识
DOI:10.1021/acs.jcim.0c00409
摘要
Deep learning has proven to be a powerful method with applications in various fields including image, language, and biomedical data. Thanks to the libraries and toolkits such as TensorFlow, PyTorch, and Keras, researchers can use different deep learning architectures and data sets for rapid modeling. However, the available implementations of neural networks using these toolkits are usually designed for a specific research and are difficult to transfer to other work. Here, we present autoBioSeqpy, a tool that uses deep learning for biological sequence classification. The advantage of this tool is its simplicity. Users only need to prepare the input data set and then use a command line interface. Then, autoBioSeqpy automatically executes a series of customizable steps including text reading, parameter initialization, sequence encoding, model loading, training, and evaluation. In addition, the tool provides various ready-to-apply and adapt model templates to improve the usability of these networks. We introduce the application of autoBioSeqpy on three biological sequence problems: the prediction of type III secreted proteins, protein subcellular localization, and CRISPR/Cas9 sgRNA activity. autoBioSeqpy is freely available with examples at https://github.com/jingry/autoBioSeqpy.
科研通智能强力驱动
Strongly Powered by AbleSci AI