计算机科学
对象(语法)
判别式
人工智能
可扩展性
补语(音乐)
目标检测
计算机视觉
上下文图像分类
模式识别(心理学)
方法
视觉对象识别的认知神经科学
图像(数学)
机器学习
面向对象程序设计
表型
基因
化学
互补
程序设计语言
数据库
生物化学
作者
Meng Meng,Tianzhu Zhang,Qi Tian,Yongdong Zhang,Feng Wu
出处
期刊:International Conference on Computer Vision
日期:2021-10-01
被引量:3
标识
DOI:10.1109/iccv48922.2021.00337
摘要
Weakly supervised object localization (WSOL) aims to localize objects with only image-level labels, which has better scalability and practicability than fully supervised methods in the actual deployment. However, with only image-level labels, learning object classification models tends to activate object parts and ignore the whole object, while expanding object parts into the whole object may deteriorate classification performance. To alleviate this problem, we propose foreground activation maps (FAM), whose aim is to optimize object localization and classification jointly via an object-aware attention module and a part-aware attention module in a unified model, where the two tasks can complement and enhance each other. To the best of our knowledge, this is the first work that can achieve remarkable performance for both tasks by optimizing them jointly via FAM for WSOL. Besides, the designed two modules can effectively highlight foreground objects for localization and discover discriminative parts for classification. Extensive experiments with four backbones on two standard benchmarks demonstrate that our FAM performs favorably against state-of-the-art WSOL methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI