期刊:Zhongguo kexue [Science in China Press] 日期:2021-09-01卷期号:51 (9): 1475-1475被引量:167
标识
DOI:10.1360/ssi-2020-0370
摘要
Object segmentation (OS) technology is a research hotspot in computer vision, and it has a wide range of applications in many fields. Cognitive vision studies have shown that human vision is highly sensitive to both global information and local details in scenes.To this end, we design a novel, efficient, and easy-to-use enhanced-alignment measure ($E_\\xi$) for evaluating the performance of the OS model.$E_\\xi$ combines local pixel values with the image-level mean value, jointly evaluates the image-/pixel-level similarity between a segmentation result and a ground-truth (GT) result.Extensive experiments on the four popular benchmarks via five meta-measures, i.e., application ranking, demoting generic, denying noise, human ranking, andrecognizing GT, show significant relative improvement compared with existing widely-adopted evaluation metrics such as IoU and $F_\\beta$.By using the weighted binary cross-entropy loss, the enhanced-alignment loss, and the weighted IoU loss, we further design a hybrid loss function (Hybrid-$E_{\\rm~loss}$) to guide the network to learn pixel-, object- and image-level features.Qualitative and quantitative results show further improvement in terms of accuracy when using our hybrid loss function in three different OS tasks.