蛋白质数据库
蛋白质数据库
UniProt公司
计算机科学
蛋白质结构数据库
结构生物信息学
蛋白质结构
表(数据库)
软件
任务(项目管理)
数据挖掘
生物信息学
蛋白质结构预测
计算生物学
序列数据库
程序设计语言
生物
生物化学
管理
经济
基因
作者
Kristine Degn,Ludovica Beltrame,Matteo Tiberti,Elena Papaleo
标识
DOI:10.1021/acs.jcim.3c00884
摘要
Computational methods relying on protein structure strongly depend on the structure selected for investigation. Typical sources of protein structures include experimental structures available at the Protein Data Bank (PDB) and high-quality in silico model structures, such as those available at the AlphaFold Protein Structure Database. Either option has significant advantages and drawbacks, and exploring the wealth of available structures to identify the most suitable ones for specific applications can be a daunting task. We provide an open-source software package, PDBminer, with the purpose of making structure identification and selection easier, faster, and less error prone. PDBminer searches the AlphaFold Database and the PDB for available structures of interest and provides an up-to-date, quality-ranked table of structures applicable for further use. PDBminer provides an overview of the available protein structures to one or more input proteins, parallelizing the runs if multiple cores are specified. The output table reports the coverage of the protein structures aligned to the UniProt sequence, overcoming numbering differences in PDB structures and providing information regarding model quality, protein complexes, ligands, and nucleic acid chain binding. The PDBminer2coverage and PDBminer2network tools assist in visualizing the results. PDBminer can be applied to overcome the tedious task of choosing a PDB structure without losing the wealth of additional information available in the PDB. Here, we showcase the main functionalities of the package on the p53 tumor suppressor protein. The package is available at http://github.com/ELELAB/PDBminer.
科研通智能强力驱动
Strongly Powered by AbleSci AI