相似性(几何)
计算机科学
化学数据库
化学相似性
化学图书馆
代表(政治)
药物发现
空格(标点符号)
化学信息学
多样性(政治)
情报检索
结构相似性
数据科学
化学空间
化学
人工智能
生物
生物信息学
小分子
操作系统
法学
社会学
图像(数学)
政治学
政治
生物化学
人类学
作者
Timothy B. Dunn,Gustavo Seabra,Taewon David Kim,K. Eurídice Juárez-Mercado,Chenglong Li,José L. Medina‐Franco,Ramón Alain Miranda‐Quintana
标识
DOI:10.1021/acs.jcim.1c01013
摘要
The quantification of chemical diversity has many applications in drug discovery, organic chemistry, food, and natural product chemistry, to name a few. As the size of the chemical space is expanding rapidly, it is imperative to develop efficient methods to quantify the diversity of large and ultralarge chemical libraries and visualize their mutual relationships in chemical space. Herein, we show an application of our recently introduced extended similarity indices to measure the fingerprint-based diversity of 19 chemical libraries typically used in drug discovery and natural products research with over 18 million compounds. Based on this concept, we introduce the Chemical Library Networks (CLNs) as a general and efficient framework to represent visually the chemical space of large chemical libraries providing a global perspective of the relation between the libraries. For the 19 compound libraries explored in this work, it was found that the (extended) Tanimoto index offers the best description of extended similarity in combination with RDKit fingerprints. CLNs are general and can be explored with any structure representation and similarity coefficient for large chemical libraries.
科研通智能强力驱动
Strongly Powered by AbleSci AI