计算机科学
静态随机存取存储器
CMOS芯片
并行计算
计算机硬件
现场可编程门阵列
量化(信号处理)
延迟(音频)
MIMD
算法
电子工程
工程类
电信
作者
Kodai Ueyoshi,Kota Ando,Kazutoshi Hirose,Shinya Takamaeda-Yamazaki,Mototsugu Hamada,Tadahiro Kuroda,Masato Motomura
出处
期刊:IEEE Journal of Solid-state Circuits
[Institute of Electrical and Electronics Engineers]
日期:2018-10-15
卷期号:54 (1): 186-196
被引量:47
标识
DOI:10.1109/jssc.2018.2871623
摘要
QUEST is a programmable multiple instruction, multiple data (MIMD) parallel accelerator for general-purpose state-of-the-art deep neural networks (DNNs). It features die-to-die stacking with three-cycle latency, 28.8 GB/s, 96 MB, and eight SRAMs using an inductive coupling technology called the ThruChip interface (TCI). By stacking the SRAMs instead of DRAMs, lower memory access latency and simpler hardware are expected. This facilitates in balancing the memory capacity, latency, and bandwidth, all of which are in demand by cutting-edge DNNs at a high level. QUEST also introduces log-quantized programmable bit-precision processing for achieving faster (larger) DNN computation (size) in a 3-D module. It can sustain higher recognition accuracy at a lower bitwidth region compared to linear quantization. The prototype QUEST chip is integrated in the 40-nm CMOS technology, and it achieves 7.49 tera operations per second (TOPS) peak performance in binary precision, and 1.96 TOPS in 4-bit precision at 300-MHz clock.
科研通智能强力驱动
Strongly Powered by AbleSci AI