计算机科学
并行计算
平行性(语法)
算法
指令级并行
数据并行性
出处
期刊:Pacific Rim Conference on Communications, Computers and Signal Processing
日期:2019-08-01
卷期号:: 1-6
标识
DOI:10.1109/pacrim47961.2019.8985073
摘要
Memory level parallelism (MLP), which refers to the number of memory requests concurrently held by Miss Status Handling Registers (MSHRs), is an indispensable factor to estimate cache performance. Unfortunately, due to the complexity of obtaining the maximum number of memory instructions or cache misses among all dependence paths, previous works in MLP modeling are very time-consuming in trace profiling. In this paper, we propose a fast model for evaluating MLP without analyzing all dependence paths in the instruction windows. Instead, we construct a probability model to estimate the maximum number of cache misses among all dependence paths in the instruction windows fed with some easily obtained inputs. By doing so, we greatly reduce the time overhead of the MLP model with a slight drop in accuracy. Twelve benchmarks chosen from Spec2006 are adopted for evaluating the accuracy and time overhead of our model. The average error of our model compared with the results of Gem5 simulations is around 9%, while the time overhead of the MLP evaluation process can be decreased to about half of previous models.
科研通智能强力驱动
Strongly Powered by AbleSci AI