计算机科学
化学空间
蛋白质数据库
蛋白质配体
对接(动物)
配体(生物化学)
蛋白质结构
虚拟筛选
蛋白质结构预测
药物发现
数据挖掘
人工智能
机器学习
生物系统
化学
生物信息学
生物
医学
生物化学
受体
护理部
有机化学
作者
Xuelian Li,Cheng Shen,Hui Zhu,Yu-Jian Yang,Qing Wang,Jincai Yang,Niu Huang
标识
DOI:10.1021/acs.jcim.3c01170
摘要
High-quality protein-ligand complex structures provide the basis for understanding the nature of noncovalent binding interactions at the atomic level and enable structure-based drug design. However, experimentally determined complex structures are scarce compared with the vast chemical space. In this study, we addressed this issue by constructing the BindingNet data set via comparative complex structure modeling, which contains 69,816 modeled high-quality protein-ligand complex structures with experimental binding affinity data. BindingNet provides valuable insights into investigating protein-ligand interactions, allowing visual inspection and interpretation of structural analogues' structure-activity relationships. It can also be used for evaluating machine-learning-based scoring functions. Our results indicate that machine learning models trained on BindingNet could reduce the bias caused by buried solvent-accessible surface area, as we previously found for models trained on the PDBbind data set. We also discussed strategies to improve BindingNet and its potential utilization for benchmarking the molecular docking methods and ligand binding free energy calculation approaches. The BindingNet complements PDBbind in constructing a sufficient and unbiased protein-ligand binding data set and is freely available at http://bindingnet.huanglab.org.cn.
科研通智能强力驱动
Strongly Powered by AbleSci AI