计算机科学
杠杆(统计)
多媒体信息检索
语义学(计算机科学)
利用
语义特征
特征(语言学)
人工智能
分类器(UML)
多媒体
机器学习
数据挖掘
语言学
哲学
计算机安全
程序设计语言
作者
Yi Yang,Jingkuan Song,Zi Huang,Zhigang Ma,Nicu Sebe,Alexander G. Hauptmann
标识
DOI:10.1109/tmm.2012.2234731
摘要
Multimedia data are usually represented by multiple features. In this paper, we propose a new algorithm, namely Multi-feature Learning via Hierarchical Regression for multimedia semantics understanding, where two issues are considered. First, labeling large amount of training data is labor-intensive. It is meaningful to effectively leverage unlabeled data to facilitate multimedia semantics understanding. Second, given that multimedia data can be represented by multiple features, it is advantageous to develop an algorithm which combines evidence obtained from different features to infer reliable multimedia semantic concept classifiers. We design a hierarchical regression model to exploit the information derived from each type of feature, which is then collaboratively fused to obtain a multimedia semantic concept classifier. Both label information and data distribution of different features representing multimedia data are considered. The algorithm can be applied to a wide range of multimedia applications and experiments are conducted on video data for video concept annotation and action recognition. Using Trecvid and CareMedia video datasets, the experimental results show that it is beneficial to combine multiple features. The performance of the proposed algorithm is remarkable when only a small amount of labeled training data are available.
科研通智能强力驱动
Strongly Powered by AbleSci AI