计算机科学
自编码
变压器
分子图
新颖性
平滑的
生成语法
人工智能
图形
机器学习
理论计算机科学
人工神经网络
工程类
电气工程
哲学
神学
计算机视觉
电压
作者
Trieu Nguyen,Aleksandra Karolak
标识
DOI:10.1101/2024.07.22.604603
摘要
ABSTRACT In the field of drug discovery, the generation of new molecules with desirable properties remains a critical challenge. Traditional methods often rely on SMILES (Simplified Molecular Input Line Entry System) representations for molecular input data, which can limit the diversity and novelty of generated molecules. To address this, we present the Transformer Graph Variational Autoencoder (TGVAE), an innovative AI model that employs molecular graphs as input data, thus captures the complex structural relationships within molecules more effectively than string models. To enhance molecular generation capabilities, TGVAE combines a Transformer, Graph Neural Network (GNN), and Variational Autoencoder (VAE). Additionally, we address common issues like over-smoothing in training GNNs and posterior collapse in VAE to ensure robust training and improve the generation of chemically valid and diverse molecular structures. Our results demonstrate that TGVAE outperforms existing approaches, generating a larger collection of diverse molecules and discovering structures that were previously unexplored. This advancement not only brings more possibilities for drug discovery but also sets a new level for the use of AI in molecular generation.
科研通智能强力驱动
Strongly Powered by AbleSci AI