人工智能
小波
公制(单位)
对比度(视觉)
计算机视觉
人类视觉系统模型
失真(音乐)
像素
计算机科学
数学
视觉掩蔽
忠诚
噪音(视频)
模式识别(心理学)
视觉感受
图像(数学)
感知
运营管理
经济
神经科学
生物
放大器
带宽(计算)
电信
计算机网络
作者
Damon M. Chandler,S.S. Hemami
出处
期刊:IEEE transactions on image processing
[Institute of Electrical and Electronics Engineers]
日期:2007-08-22
卷期号:16 (9): 2284-2298
被引量:1123
标识
DOI:10.1109/tip.2007.901820
摘要
This paper presents an efficient metric for quantifying the visual fidelity of natural images based on near-threshold and suprathreshold properties of human vision. The proposed metric, the visual signal-to-noise ratio (VSNR), operates via a two-stage approach. In the first stage, contrast thresholds for detection of distortions in the presence of natural images are computed via wavelet-based models of visual masking and visual summation in order to determine whether the distortions in the distorted image are visible. If the distortions are below the threshold of detection, the distorted image is deemed to be of perfect visual fidelity (VSNR = infinity) and no further analysis is required. If the distortions are suprathreshold, a second stage is applied which operates based on the low-level visual property of perceived contrast, and the mid-level visual property of global precedence. These two properties are modeled as Euclidean distances in distortion-contrast space of a multiscale wavelet decomposition, and VSNR is computed based on a simple linear sum of these distances. The proposed VSNR metric is generally competitive with current metrics of visual fidelity; it is efficient both in terms of its low computational complexity and in terms of its low memory requirements; and it operates based on physical luminances and visual angle (rather than on digital pixel values and pixel-based dimensions) to accommodate different viewing conditions.
科研通智能强力驱动
Strongly Powered by AbleSci AI