计算机科学
人工智能
机器学习
模棱两可
水准点(测量)
多标签分类
矩阵分解
特征(语言学)
数据挖掘
语言学
特征向量
物理
哲学
大地测量学
量子力学
程序设计语言
地理
作者
Ke Wang,Ning Xu,Miaogen Ling,Xin Geng
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2021-01-01
卷期号:: 1-1
被引量:10
标识
DOI:10.1109/tkde.2021.3092406
摘要
Label Distribution Learning (LDL) has attracted increasing research attentions due to its potential to address the label ambiguity problem in machine learning and success in many real-world applications. In LDL, it is usually expensive to obtain the ground-truth label distributions of data, but it is relatively easy to obtain the logical labels of data. How to use training instances only with logical labels to learn an effective LDL model is a challenging problem. In this paper, we propose a two-step framework to address this problem. Specifically, we firstly design an efficient recovery model to recover the latent label distributions of training instances, named Fast Label Enhancement (FLE). Our idea is to use non-negative matrix factorization (NMF) to mine the label distribution information from the feature space. Moreover, we take the instance-class similarities into consideration to discover the importance of each label to training instances, which is useful for learning precise label distributions. Then, we train a predictive model for testing instances based on generated label distributions of training instances and an existing LDL method (e.g., SA-BFGS). Experimental results on fifteen benchmark datasets show the effectiveness of the proposed two-step framework and verify the superiority of FLE over several state-of-the-art approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI