推论
计算机科学
热的
航程(航空)
功率(物理)
可靠性(半导体)
人工神经网络
深层神经网络
符号
晶体管
计算机工程
计算科学
电子工程
电压
电气工程
数学
材料科学
人工智能
算术
物理
工程类
气象学
复合材料
量子力学
作者
Georgios Zervakis,Iraklis Anagnostopoulos,Sami Salamin,Ourania Spantidi,Isai Roman-Ballesteros,Jörg Henkel,Hussam Amrouch
标识
DOI:10.1109/tc.2022.3141054
摘要
Recent breakthroughs in Neural Networks (NNs) have made DNN accelerators ubiquitous and led to an ever-increasing quest on adopting them from Cloud to edge computing. However, state-of-the-art DNN accelerators pack immense computational power in a relatively confined area, inducing significant on-chip power densities that lead to intolerable thermal bottlenecks. Existing state of the art focuses on using approximate multipliers only to trade-off efficiency with inference accuracy. In this work, we present a thermal-aware approximate DNN accelerator design in which we additionally trade-off approximation with temperature effects towards designing DNN accelerators that satisfy tight temperature constraints. Using commercial multi-physics tool flows for heat simulations, we demonstrate how our thermal-aware approximate design reduces the temperature from 139 $^{\circ }$ C, in an accurate circuit, down to 79 $^{\circ }$ C. This enables DNN accelerators to fulfill tight thermal constraints, while still maximizing the performance and reducing the energy by around 75% with a negligible accuracy loss of merely 0.44% on average for a wide range of NN models. Furthermore, using physics-based transistor aging models, we demonstrate how reductions in voltage and temperature obtained by our approximate design considerably improve the circuit’s reliability. Our approximate design exhibits around 40% less aging-induced degradation compared to the baseline design.
科研通智能强力驱动
Strongly Powered by AbleSci AI