化学信息学
相似性(几何)
指纹(计算)
计算机科学
最近邻搜索
数据挖掘
化学数据库
秩(图论)
人工智能
虚拟筛选
分子描述符
价值(数学)
数量结构-活动关系
模式识别(心理学)
机器学习
数学
生物信息学
药物发现
图像(数学)
组合数学
生物
作者
Martin Vogt,Jürgen Bajorath
标识
DOI:10.1002/minf.201600131
摘要
Similarity searching using molecular fingerprints has a long history in chemoinformatics and continues to be a popular approach for virtual screening. Typically, known active reference molecules are used to search databases for new active compounds. However, this search has black box character because similarity value distributions are dependent on fingerprints and compound classes. Consequently, no generally applicable similarity threshold values are available as reliable indicators of activity relationships between reference and database compounds. Therefore, it is generally uncertain where new active compounds might appear in database rankings, if at all. In this contribution, methods are discussed for modeling similarity value distributions of fingerprint search calculations using Tanimoto coefficients and estimating rank positions of active compounds. To our knowledge, these are the first approaches for predicting the results of fingerprint-based similarity searching.
科研通智能强力驱动
Strongly Powered by AbleSci AI