计算机科学
支持向量机
边距(机器学习)
等级制度
分类
任务(项目管理)
文本分类
人工智能
机器学习
班级(哲学)
功能(生物学)
类层次结构
判别式
数据挖掘
工程类
进化生物学
生物
面向对象程序设计
经济
程序设计语言
系统工程
市场经济
作者
Lijuan Cai,Thomas Hofmann
标识
DOI:10.1145/1031171.1031186
摘要
Automatically categorizing documents into pre-defined topic hierarchies or taxonomies is a crucial step in knowledge and content management. Standard machine learning techniques like Support Vector Machines and related large margin methods have been successfully applied for this task, albeit the fact that they ignore the inter-class relationships. In this paper, we propose a novel hierarchical classification method that generalizes Support Vector Machine learning and that is based on discriminant functions that are structured in a way that mirrors the class hierarchy. Our method can work with arbitrary, not necessarily singly connected taxonomies and can deal with task-specific loss functions. All parameters are learned jointly by optimizing a common objective function corresponding to a regularized upper bound on the empirical loss. We present experimental results on the WIPO-alpha patent collection to show the competitiveness of our approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI