计算机科学
并行计算
多核处理器
离散化
库达
流式处理
图形处理单元的通用计算
计算科学
执行时间
建筑
绘图
计算机图形学(图像)
数学
数学分析
艺术
视觉艺术
作者
Qi-Kun Xue,Yang Wang,Xiangyun Chang
标识
DOI:10.3997/2214-4609.201413105
摘要
Reverse time migration (RTM) based on the full wave equation discretization are tools of major importance because they give an accurate representation of complex wave propagation areas. However, these applications impose high computational requirements, thus efficient parallelization of the RTM algorithm is crucial. In this paper, we present a multi-stream optimizing approach for the RTM algorithm on the latest Kepler architecture GPU. Our approach take full advantage of the Hyper-Q new feature of the Kepler architecture GPU so that multiple streams generated from a quantity of OpenMP threads operating on the CPU multicore processor can be rapidly running on massive fine-grain GPU processing cores. Experimental results show that our approach 17% performance improvement to the single stream implementation on Nvidia Tesla K40 GPU.
科研通智能强力驱动
Strongly Powered by AbleSci AI