缩放空间
平滑的
代表(政治)
比例(比率)
计算机科学
特征(语言学)
核(代数)
计算
空格(标点符号)
公理
财产(哲学)
算法
高斯分布
多样性(控制论)
模式识别(心理学)
人工智能
理论计算机科学
数学
图像处理
计算机视觉
图像(数学)
几何学
纯数学
哲学
物理
操作系统
认识论
法学
政治
量子力学
语言学
政治学
摘要
An inherent property of objects in the world is that they only exist as meaningful entities over certain ranges of scale. If one aims to describe the structure of unknown real-world signals, then a multi-scale representation of data is of crucial importance. This paper gives a tutorial review of a special type of multi-scale representation—linear scale-space representation—which has been developed by the computer vision community to handle image structures at different scales in a consistent manner. The basic idea is to embed the original signal into a one-parameter family of gradually smoothed signals in which the fine-scale details are successively suppressed. Under rather general conditions on the type of computations that are to be performed at the first stages of visual processing, in what can be termed ‘the visual front-end’, it can be shown that the Gaussian kernel and its derivatives are singled out as the only possible smoothing kernels. The conditions that specify the Gaussian kernel are, basically, linearity and shift invariance, combined with different ways of formalizing the notion that structures at coarse scales should correspond to simplifications of corresponding structures at fine scales-they should not be accidental phenomena created by the smoothing method. Notably, several different ways of choosing scale-space axioms give rise to the same conclusion. The output from the scale-space representation can be used for a variety of early visual tasks; operations such as feature detection, feature classification and shape computation can be expressed directly in terms of (possibly non-linear) combinations of Gaussian derivatives at multiple scales. In this sense the scale-space representation canserve as a basis for early vision. During the last few decades, a number of other approaches to multiscale representations have been developed, which are more or less related to scale-space theory, notably the theories of pyramids, wavelets and multi grid methods.Despite their qualitative differences, the increasing propularity of each of these approaches indicates that the crucial notion of scale is increasingly appreciated by the computer.vision community and by researchers in other related fields. An interesting similarity to biological vision is that the scale-space operators closely resemble receptive field profiles registered in neurophysiological studies of the mam- malian retina and visual cortex.
科研通智能强力驱动
Strongly Powered by AbleSci AI