计算机科学
Canny边缘检测器
加速
并行计算
软件可移植性
库达
微分边缘检测器
数据并行性
图像渐变
核(代数)
边缘检测
算法
图像(数学)
图像处理
平行性(语法)
人工智能
数学
组合数学
程序设计语言
作者
Yupu Song,Cailin Li,Qinglei Zhou,Han Xiao
出处
期刊:Research Square - Research Square
日期:2023-04-06
标识
DOI:10.21203/rs.3.rs-2774366/v1
摘要
Abstract In the process of Canny edge detection, a large number of high complexity calculations such as Gaussian filtering, gradient calculation, non-maximum suppression, and double threshold judgment need to be performed on the image, which takes up a lot of operation time, which is a great challenge to the real-time requirements of the algorithm. In order to solve this problem, a fine-grained parallel Canny edge detection method is proposed, which is optimized from three aspects: task partition, vector memory access, and NDRange optimization, and CPU-GPU collaborative parallelism is realized. At the same time, the parallel Canny edge detection methods based on multi-core CPU and CUDA architecture are designed. The experimental results show that OpenCL accelerated Canny edge detection algorithm can achieve 20.68 times, 3.96 times, and 1.21 times speedup ratio compared with CPU serial algorithm, CPU multi-threaded parallel algorithm, and CUDA-based parallel algorithm, respectively. The effectiveness and performance portability of the proposed Canny edge detection parallel algorithm are verified, and it provides a reference for the research of fast calculation of image big data.
科研通智能强力驱动
Strongly Powered by AbleSci AI