计算机科学
硬件加速
计算
杠杆(统计)
软件
加速度
计算机工程
计算科学
并行计算
张量(固有定义)
钥匙(锁)
计算机体系结构
人工智能
算法
程序设计语言
计算机安全
数学
经典力学
物理
纯数学
作者
Shail Dave,Riyadh Baghdadi,Tony Nowatzki,Sasikanth Avancha,Aviral Shrivastava,Baoxin Li
出处
期刊:Proceedings of the IEEE
[Institute of Electrical and Electronics Engineers]
日期:2021-10-01
卷期号:109 (10): 1706-1752
被引量:54
标识
DOI:10.1109/jproc.2021.3098483
摘要
Machine learning (ML) models are widely used in many important domains. For efficiently processing these computational- and memory-intensive applications, tensors of these overparameterized models are compressed by leveraging sparsity, size reduction, and quantization of tensors. Unstructured sparsity and tensors with varying dimensions yield irregular computation, communication, and memory access patterns; processing them on hardware accelerators in a conventional manner does not inherently leverage acceleration opportunities. This article provides a comprehensive survey on the efficient execution of sparse and irregular tensor computations of ML models on hardware accelerators. In particular, it discusses enhancement modules in the architecture design and the software support, categorizes different hardware designs and acceleration techniques, analyzes them in terms of hardware and execution costs, analyzes achievable accelerations for recent DNNs, and highlights further opportunities in terms of hardware/software/model codesign optimizations (inter/intramodule). The takeaways from this article include the following: understanding the key challenges in accelerating sparse, irregular shaped, and quantized tensors; understanding enhancements in accelerator systems for supporting their efficient computations; analyzing tradeoffs in opting for a specific design choice for encoding, storing, extracting, communicating, computing, and load-balancing the nonzeros; understanding how structured sparsity can improve storage efficiency and balance computations; understanding how to compile and map models with sparse tensors on the accelerators; and understanding recent design trends for efficient accelerations and further opportunities.
科研通智能强力驱动
Strongly Powered by AbleSci AI