作者
Pin-Han Li,Meihua Wang,Zhun Fan,Han Huang,Guijie Zhu,J. C. Zhuang
摘要
Cracks are among the most common, most likely, and earliest of all pavement distresses. Detecting and repairing cracks as early as possible can help extend the service life of pavements. However, Detecting cracks with precision can be challenging due to their varied structural characteristics and complex background interference. In this paper, a new convolutional neural network architecture, OUR-Net, is designed to more efficiently treat both high-and low-frequency visual image features. An Ocatve Convolution is incorporated into the proposed network as an enhancement to conventional convolution. In particular, an Octave Convolution Residual Block (OCRB) is embedded in the encoder to replace the convolutional layer of the classical encoder. Moerover, we propose Octave Max Unpooling (OMU) as the upsampling operation of the decoder, enabling the neural network to learn how to decode multi-spatial frequency features. Compared with models using traditional convolution, OUR-Net has better capability of processing multi-scale information, thus simultaneously improving model performance while saving computational costs by reducing spatial redundancy. We evaluate the superiority of the proposed method by comparing it to state-of-the-art crack segmentation methods on four public datasets (CrackLS315, CFD, Crack200, DeepCrack), which encompass cracks of various widths. Comprehensive experimental results reveal that the proposed method performs excellently, which achieves F1-score and mIoU of 0.9112, 0.9271, 0.8106, 0.9318, and 0.8369, 0.8644, 0.6815, 0.8723, respectively, on the four datasets. A lightweight version of the proposed network is constructed using depthwise separable convolution that achieves excellent performance with only 0.88M parameters.