计算机科学
微控制器
嵌入式系统
ARM体系结构
固件
边缘计算
边缘设备
人工神经网络
多核处理器
高效能源利用
计算机硬件
并行计算
人工智能
云计算
操作系统
物联网
工程类
电气工程
作者
Xiaying Wang,Michele Magno,Lukas Cavigelli,Luca Benini
标识
DOI:10.1109/jiot.2020.2976702
摘要
The growing number of low-power smart devices in the Internet of Things is coupled with the concept of "edge computing" that is moving some of the intelligence, especially machine learning, toward the edge of the network. Enabling machine learning algorithms to run on resource-constrained hardware, typically on low-power smart devices, is challenging in terms of hardware (optimized and energy-efficient integrated circuits), algorithmic, and firmware implementations. This article presents a FANN-on-MCU, an open-source toolkit built upon the fast artificial neural network (FANN) library to run lightweight and energy-efficient neural networks on microcontrollers based on both the ARM Cortex-M series and the novel RISC-V-based parallel ultralow-power (PULP) platform. The toolkit takes multilayer perceptrons trained with FANN and generates code targeted to low-power microcontrollers. This article also presents detailed analyses of energy efficiency across the different cores, and the optimizations to handle different network sizes. Moreover, it provides a detailed analysis of parallel speedups and degradations due to parallelization overhead and memory transfers. Further evaluations include experimental results for three different applications using a self-sustainable wearable multisensor bracelet. The experimental results show a measured latency in the order of only a few microseconds and power consumption of a few milliwatts while keeping the memory requirements below the limitations of the targeted microcontrollers. In particular, the parallel implementation on the octa-core RISC-V platform reaches a speedup of 22× and a 69% reduction in energy consumption with respect to a single-core implementation on Cortex-M4 for continuous real-time classification.
科研通智能强力驱动
Strongly Powered by AbleSci AI