RGB颜色模型
人工智能
计算机视觉
计算机科学
基本事实
材料科学
光学
物理
作者
Tao Yan,Shufan Xu,Hao Huang,Helong Li,Tan Lu,Xiaojun Chang,Rynson W. H. Lau
标识
DOI:10.1016/j.knosys.2024.111722
摘要
Glass surfaces are ubiquitous in human life environment, such as glass windows, glass doors, glass guardrails and glass walls. Most glass surfaces are transparent without intrinsic texture and color characteristics. Such characteristics pose significant challenges for artificial intelligence systems to identify glass surfaces. We observed that reflections on glass surfaces of near-infrared (NIR) images are always significantly suppressed compared with that of regular RGB images captured from the same scene. Thus, we propose an effective glass surface detection network, called NRGlassNet, which takes NIR-RGB image pair captured from the same scene as input. Our NRGlassNet employ a dual-branch structure consisting of powerful Swin-Transformer blocks to extract features from the NIR image and the RGB image separately. We also propose a novel Multi-modal Context Contrast (MCC) module to modulate the differences of reflection intensities in the NIR image and the RGB image for identifying glass surfaces. In addition, for learning our proposed network, we propose a new dataset, called RNGD, which consists of 1378 NIR-RGB image pairs captured from real-world scenes as well as their ground-truth glass surface annotations. Quantitative and qualitative evaluations demonstrate the effectiveness and superiority of our proposed method. Our code and dataset will be available at: https://github.com/YT3DVision/NRGlassNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI