计算机科学
计算机体系结构
精简计算指令集
指令集
硬件加速
嵌入式系统
建筑
钥匙(锁)
灵活性(工程)
计算机工程
计算机硬件
现场可编程门阵列
操作系统
艺术
视觉艺术
统计
数学
作者
Harini Sriraman,Aswathy Ravikumar,Dhruv Garg
出处
期刊:Lecture notes on data engineering and communications technologies
日期:2020-12-21
卷期号:: 287-300
被引量:2
标识
DOI:10.1007/978-981-15-8767-2_25
摘要
Developing prototypes for systems with custom chips is the most recent advancement in accelerator-centric architectures. The development of such prototype is a challenging task but with the emergence of new open-source software's and hardware's help in addressing this challenge to an extent by reducing implementation, design and effort. With increasing utility of Artificial Intelligence (AI) in various applications, there are great benefits to be reaped from faster training and computation of neural networks. The objective of this research is to develop an open-source architecture for acceleration of AI applications. We studied in detail the existing and available AI accelerators, and understood their key features. Their key functionalities were incorporated in the design of the VeNNus processor. This paper describes the architecture of the VeNNus processor, which uses the RISC-V Instruction Set Architecture (ISA). RISC-V ISA offers benefits such as flexibility, lower costs and high efficiency. However, to further improve the performance and energy efficiency of our Artificial Intelligence (AI) accelerator, 16 custom vector instructions were added in extension to the RISC-V ISA. The VeNNus processor includes redundant Arithmetic-Logic Units (ALUs), and uses quantization of training weights to 8-bit integers to deliver better performance, driving higher throughput. With the processor using vector instructions and quantization, we anticipate good acceleration of deep neural networks. The vital benefits and challenges faced with the RISC-V instruction set are described in this paper.
科研通智能强力驱动
Strongly Powered by AbleSci AI