索贝尔算子
计算机科学
核(代数)
加速
操作员(生物学)
图形处理单元
并行计算
库达
算法
计算科学
计算机图形学(图像)
边缘检测
图像处理
人工智能
图像(数学)
数学
生物化学
化学
抑制因子
转录因子
基因
组合数学
作者
Qiong Chang,Xiang Li,Yun Li,Junichi Miyazaki
标识
DOI:10.1016/j.jpdc.2023.03.004
摘要
Sobel is one of the most popular edge detection operators used in image processing. To date, most users utilize the two-directional 3×3 Sobel operator as detectors because of its low computational cost and reasonable performance. Simultaneously, many studies have been conducted on using large multi-directional Sobel operators to satisfy their needs considering the high stability, but at an expense of speed. This paper proposes a fast graphics processing unit (GPU) kernel for the four-directional 5×5 Sobel operator. To improve kernel performance, we implement the kernel based on warp-level primitives, which can significantly reduce the number of memory accesses. In addition, we introduce the prefetching mechanism and operator transformation into the kernel to significantly reduce the computational complexity and data transmission latency. Compared with the OpenCV-GPU library, our kernel shows high performances of 6.7x speedup on a Jetson AGX Xavier GPU and 13x on a GTX 1650Ti GPU.
科研通智能强力驱动
Strongly Powered by AbleSci AI