金属蛋白
连接(拓扑)
同源(生物学)
计算机科学
计算生物学
配体(生物化学)
信号转导衔接蛋白
化学
人工智能
生物
生物化学
受体
基因
数学
酶
组合数学
作者
Y Z Wang,Xiang Liu,Yipeng Zhang,Xiangjun Wang,Kelin Xia
标识
DOI:10.1021/acs.jcim.4c02309
摘要
With the crucial role of metalloproteins in respiration, oxidative stress protection, photosynthesis, and drug metabolism, the design and discovery of drugs that can target metalloproteins are extremely important. Recently, enormous potential has been shown by topological data analysis (TDA) and TDA-based machine learning models in various steps of drug design and discovery. Here, we propose, for the first time, join persistent homology (JPH) and JPH-based machine learning models for metalloprotein-ligand binding affinity prediction. Mathematically, dramatically different from persistent homology and extended persistent homology, our JPH employs a set of filtration functions to generate a multistage filtration for the join of the original simplicial complex and a specially designed test simplicial complex. From the featurization perspective, our JPH-based molecular descriptors can provide a more comprehensive characterization of the intrinsic topological information of the data. Our JPH descriptors are combined with the gradient boosting tree (GBT) model for metalloprotein-ligand binding affinity prediction. The benchmark dataset for metalloprotein-ligand complexes from PDBbind-v2020 is employed for the validation and comparison of our model. It has been found that our JPH-GBT model can outperform all of the existing models, as far as we know. This demonstrates the great potential of our join persistent homology in the characterization of molecular structures and functions.
科研通智能强力驱动
Strongly Powered by AbleSci AI