计算机科学
树形结构
树(集合论)
嵌入
理论计算机科学
图形
人工智能
等级制度
先验与后验
分层数据库模型
数据挖掘
数据结构
机器学习
数学
程序设计语言
数学分析
哲学
认识论
经济
市场经济
作者
Deji Zhao,Bo Niu,Shuangyong Song,Chao Wang,Xiangyan Chen,Xiang Yu,Bo Zou
标识
DOI:10.1007/978-3-031-25198-6_3
摘要
Hierarchical text classification (HTC) is a challenging task that classifies textual descriptions with a taxonomic hierarchy. Existing methods have difficulties in modeling the hierarchical label structure. They focus on using the graph embedding methods to encode the hierarchical structure, ignoring that the HTC labels are based on a tree structure. There is a difference between tree and graph structure: in the graph structure, message passing is undirected, which will lead to the imbalance of message transmission between nodes when applied to HTC. As the nodes in different layers have inheritance relationships, the information transmission between nodes should be directional and hierarchical in the HTC task. In this paper, we propose a Top-Down Tree Structure Awareness Model to extract the hierarchical structure features, called ToSA. We regard HTC as a sequence generation task and introduce a priori hierarchical information in the decoding process. We block the information flow in one direction to ensure the graph embedding method is more suitable for the HTC task, then get the enhanced tree structure representation. Experiment results show that our model can achieve the best results on both the public WOS dataset and a collected E-commerce user intent classification dataset $$^3$$ .
科研通智能强力驱动
Strongly Powered by AbleSci AI