计算机科学
边距(机器学习)
人工智能
水准点(测量)
基本事实
残余物
深度学习
异常检测
模式识别(心理学)
图像(数学)
计算机视觉
图像质量
噪音(视频)
机器学习
算法
大地测量学
地理
作者
Tong Chen,Bin Li,Jinhua Zeng
标识
DOI:10.1109/lsp.2023.3245947
摘要
Most of existing deep learning models for image forgery localization rely on a large number of high-quality labeled samples for training. The training procedures are performed off-line and without adaptation to the image under scrutiny. In this letter, we propose to perform run-time learning of the forgery traces from the suspicious image itself. To this aim, a Variational Auto-Encoder (VAE) model is learned to reconstruct small cliques of the suspicious image, and those cliques with anomalous larger reconstruction errors are therein identified as forged. To further enhance performance, Vision Transform (ViT) is employed as the VAE encoder, and multi-modal input information is explored by considering noise inconsistency, high-pass residual inconsistency, and edge discontinuity. Evaluation on widely used benchmark datasets shows our method outperforms existing blind methods by a large margin, and is competitive against approaches that use ground-truth for supervised training.
科研通智能强力驱动
Strongly Powered by AbleSci AI