可解释性
计算机科学
财产(哲学)
代表(政治)
药物发现
人工智能
特征(语言学)
机器学习
一般化
特征学习
分子描述符
数据挖掘
数量结构-活动关系
化学
数学
数学分析
哲学
生物化学
语言学
认识论
政治
政治学
法学
作者
Jinzhou Wu,Su Yang,Aixi Yang,Jingzheng Ren,Yi Xiang
标识
DOI:10.1016/j.compbiomed.2023.107452
摘要
Accurate characterization of molecular representations plays an important role in the property prediction based on deep learning (DL) for drug discovery. However, most previous researches considered only one type of molecular representations, resulting in that it difficult to capture the full molecular feature information. In this study, a novel DL framework called multi-modal molecular representation learning fusion network (MMRLFN) is developed, which could simultaneously learn and integrate drug molecular features from molecular graphs and SMILES sequences. The developed MMRLFN method is composed of three complementary deep neural networks to learn various features from different molecular representations, such as molecular topology, local chemical background information, and substructures at varying scales. Eight public datasets involving various molecular properties used in drug discovery were employed to train and evaluate the developed MMRLFN. The obtained models showed better performances than the existing models based on mono-modal molecular representations. Additionally, a thorough analysis of the noise resistance and interpretability of the MMRLFN has been carried out. The generalization ability and effectiveness of the MMRLFN has been verified by case studies as well. Overall, the MMRLFN can accurately predict molecular properties and provide potentially valuable information from large datasets, thereby maximizing the possibility of successful drug discovery.
科研通智能强力驱动
Strongly Powered by AbleSci AI