计算机科学
人工智能
多标签分类
集合(抽象数据类型)
模式识别(心理学)
机器学习
k-最近邻算法
分类
班级(哲学)
先验与后验
任务(项目管理)
基于实例的学习
半监督学习
哲学
管理
认识论
经济
程序设计语言
作者
Min-Ling Zhang,Zhi‐Hua Zhou
标识
DOI:10.1016/j.patcog.2006.12.019
摘要
Multi-label learning originated from the investigation of text categorization problem, where each document may belong to several predefined topics simultaneously. In multi-label learning, the training set is composed of instances each associated with a set of labels, and the task is to predict the label sets of unseen instances through analyzing training instances with known label sets. In this paper, a multi-label lazy learning approach named ML-KNN is presented, which is derived from the traditional K-nearest neighbor (KNN) algorithm. In detail, for each unseen instance, its K nearest neighbors in the training set are firstly identified. After that, based on statistical information gained from the label sets of these neighboring instances, i.e. the number of neighboring instances belonging to each possible class, maximum a posteriori (MAP) principle is utilized to determine the label set for the unseen instance. Experiments on three different real-world multi-label learning problems, i.e. Yeast gene functional analysis, natural scene classification and automatic web page categorization, show that ML-KNN achieves superior performance to some well-established multi-label learning algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI