空格(标点符号)
计算机科学
国家(计算机科学)
计算机视觉
人工智能
计算机图形学(图像)
操作系统
算法
作者
Kai Zhao,Shufan Peng,Yujin Li,Tianliang Lu
标识
DOI:10.1038/s41598-025-96035-1
摘要
X-ray image-based prohibited item detection plays a crucial role in modern public security systems. Despite significant advancements in deep learning, challenges such as feature extraction, object occlusion, and model complexity remain. Although recent efforts have utilized larger-scale CNNs or ViT-based architectures to enhance accuracy, these approaches incur substantial trade-offs, including prohibitive computational overhead and practical deployment limitations. To address these issues, we propose Xray-YOLO-Mamba, a lightweight model that integrates the YOLO and Mamba architectures. Key innovations include the CResVSS block, which enhances receptive fields and feature representation; the SDConv downsampling block, which minimizes information loss during feature transformation; and the Dysample upsampling block, which improves resolution recovery during reconstruction. Experimental results demonstrate that the proposed model achieves superior performance across three datasets, exhibiting robust performance and excellent generalization ability. Specifically, our model attains mAP50-95 of 74.6% (CLCXray), 43.9% (OPIXray), and 73.9% (SIXray), while demonstrating lightweight efficiency with 4.3 M parameters and 10.3 GFLOPs. The architecture achieves real-time performance at 95.2 FPS on the GPUs. In summary, Xray-YOLO-Mamba strikes a favorable balance between precision and computational efficiency, demonstrating significant advantages.
科研通智能强力驱动
Strongly Powered by AbleSci AI