对抗制
深度学习
人工神经网络
嵌入
卷积神经网络
作者
Debang Li,Huikai Wu,Junge Zhang,Kaiqi Huang
出处
期刊:IEEE Transactions on Image Processing
日期:2019-05-07
卷期号:28 (10): 5105-5120
被引量:14
标识
DOI:10.1109/tip.2019.2914360
摘要
Image cropping aims at improving the quality of images by removing unwanted outer areas, which is widely used in the photography and printing industry. Most of the previous cropping methods that do not need bounding box supervision rely on the sliding window mechanism. The sliding window method results in fixed aspect ratios and limits the shape of the cropping region. Moreover, the sliding window method usually produces lots of candidates on the input image, which is very time-consuming. Motivated by these challenges, we formulate image cropping as a sequential decision-making process and propose a reinforcement learning-based framework to address this problem, namely, Fast Aesthetics-Aware Adversarial Reinforcement Learning ( Fast A3RL). Particularly, the proposed method develops an aesthetics-aware reward function that is dedicated for image cropping. Similar to human’s decision-making process, we use a comprehensive state representation, including both the current observation and the historical experience. We train the agent using the actor-critic architecture in an end-to-end manner. The adversarial learning process is also applied during the training stage. The proposed method is evaluated on several popular cropping datasets, in which the images are unseen during training. The experiment results show that our method achieves the state-of-the-art performance with much fewer candidate windows and much less time compared with related methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI