代表(政治)
计算机科学
下部结构
特征(语言学)
子网
人工智能
透视图(图形)
模式识别(心理学)
图层(电子)
财产(哲学)
数据挖掘
纳米技术
材料科学
哲学
法学
工程类
认识论
政治
结构工程
语言学
计算机安全
政治学
作者
Tao Ren,Haodong Zhang,Yang Shi,Ximeng Luo,Siqi Zhou
标识
DOI:10.1016/j.jmgm.2022.108401
摘要
Recently, molecular representation and property exploration, with the combination of neural network, play a critical role in the field of drug design and discovery for assisting in drug related research. However, previous research in molecular representation relies heavily on artificial extraction of features based on biological experiments which may result in a manually introduced noise of molecular information with high cost in time and money. In this paper, a novel method named Substructural Hierarchical Attention Network (SuHAN) is proposed to discover inherent characteristics of molecules for representation learning. Specifically, SuHAN is composed of the cascaded layer: atom-level layer and substructure-level layer. Molecule in the SMILES format is divided into several substructural fragments by predefined partition rules, and then they are fed into atom-level layer and substructure-level layer successively to obtain feature representation from different perspective: atomic view and substructural view. In this way, the prominent structural features that may be omitted in global extraction are excavated from a fine-grained viewpoint and fused to reconstruct representative pattern in an overall view. Experiments on biophysics and physiology datasets demonstrate that our model is competitive with a significant improvement of both accuracy and stability in performance. We confirmed that the substructural segments and progressive hierarchical networks lead to an effective molecular representation for downstream tasks. These results provide a novel perspective about reconstructing overall pattern through local prominent structure.
科研通智能强力驱动
Strongly Powered by AbleSci AI