计算机科学
人工智能
纹理(宇宙学)
图像(数学)
比例(比率)
深度学习
计算机视觉
网(多面体)
模式识别(心理学)
机器学习
数学
地图学
几何学
地理
作者
Qiang Xu,Shan Jia,Xinghao Jiang,Tanfeng Sun,Zhe Wang,Hong Yan
标识
DOI:10.1016/j.eswa.2024.123368
摘要
Distinguishing between computer-generated (CG) and natural photographic (PG) images is of great importance to verify the authenticity and originality of digital images. However, the recent cutting-edge generation methods enable high qualities of synthesis in CG images, which makes this challenging task even trickier. To address this issue, a novel multi-scale deep texture learning neural network coined as MDTL-NET is proposed for CG image detection. We first utilize a global texture representation module incorporating the ResNet architecture to capture multi-scale texture patterns. Then, a deep texture enhancement module based on a semantic segmentation map guided affine transformation operation is designed for texture difference amplification. To enhance performance, we equip the MDTL-NET with channel and spatial attention mechanisms, which refines intermediate features and facilitates trace exploration in different domains. Moreover, a Low-rank Tensor Representation (LTR) strategy is also used for feature fusion. Extensive experiments on three public datasets and a newly constructed dataset1 with more realistic and diverse images show that the proposed approach outperforms existing methods in the field by a clear margin. Besides, results also demonstrate the detection robustness and generalization ability of the proposed approach to postprocessing operations.
科研通智能强力驱动
Strongly Powered by AbleSci AI