棱锥(几何)
人工智能
计算机视觉
计算机科学
目标检测
稳健性(进化)
平滑的
斑点检测
模式识别(心理学)
边缘检测
图像处理
数学
图像(数学)
基因
化学
生物化学
几何学
作者
Rui Xue,Jialu Duan,Zhengwei Du
标识
DOI:10.1016/j.imavis.2024.105202
摘要
Object detection has broad applications in areas such as autonomous driving, security surveillance, and deep-sea exploration. However, the performance of current detection algorithms significantly decreases due to the loss of detail, increased noise, and color distortion in images under low-light or nighttime conditions. To address this problem, we propose a plug-and-play multiscale pyramid enhancement network (MPENet), which elegantly cascades with RT-DETR to establish an end-to-end framework for low-light object detection, named MPE-DETR. First, MPENet utilizes Gaussian blur to decompose images into Gaussian pyramids and Laplacian pyramids at different resolutions. Specifically, we designed a high-frequency texture enhancement (HTE) module to capture the edge and texture information of images, and a low-frequency noise smoothing (LNS) module to better understand the overall structure of images and capture global-scale features. Additionally, by concatenating the output features of the HTE and LNS modules along the channel dimension, feature fusion across different scales is realized. We conducted experiments on the ExDark and ExDark + LOD datasets, which are designed for low-light object detection. The results indicate that the proposed method achieved an improvement of 2.1% in [email protected] compared to that of existing SOTA models on the ExDark dataset, and demonstrated strong generalizability and robustness on the ExDark + LOD dataset. The code and results are available at https://github.com/PZDJL/MPENet.
科研通智能强力驱动
Strongly Powered by AbleSci AI