FANN-on-MCU: An Open-Source Toolkit for Energy-Efficient Neural Network Inference at the Edge of the Internet of Things

计算机科学微控制器嵌入式系统 ARM体系结构固件边缘计算边缘设备人工神经网络多核处理器高效能源利用计算机硬件并行计算人工智能云计算操作系统物联网工程类电气工程

作者

Xiaying Wang,Michele Magno,Lukas Cavigelli,Luca Benini

出处

期刊：IEEE Internet of Things Journal [Institute of Electrical and Electronics Engineers]
日期：2020-02-27 卷期号：7 (5): 4403-4417 被引量：134

链接

arxiv.org arxiv.orgdoi.org

标识

DOI：10.1109/jiot.2020.2976702

摘要

The growing number of low-power smart devices in the Internet of Things is coupled with the concept of "edge computing" that is moving some of the intelligence, especially machine learning, toward the edge of the network. Enabling machine learning algorithms to run on resource-constrained hardware, typically on low-power smart devices, is challenging in terms of hardware (optimized and energy-efficient integrated circuits), algorithmic, and firmware implementations. This article presents a FANN-on-MCU, an open-source toolkit built upon the fast artificial neural network (FANN) library to run lightweight and energy-efficient neural networks on microcontrollers based on both the ARM Cortex-M series and the novel RISC-V-based parallel ultralow-power (PULP) platform. The toolkit takes multilayer perceptrons trained with FANN and generates code targeted to low-power microcontrollers. This article also presents detailed analyses of energy efficiency across the different cores, and the optimizations to handle different network sizes. Moreover, it provides a detailed analysis of parallel speedups and degradations due to parallelization overhead and memory transfers. Further evaluations include experimental results for three different applications using a self-sustainable wearable multisensor bracelet. The experimental results show a measured latency in the order of only a few microseconds and power consumption of a few milliwatts while keeping the memory requirements below the limitations of the targeted microcontrollers. In particular, the parallel implementation on the octa-core RISC-V platform reaches a speedup of 22× and a 69% reduction in energy consumption with respect to a single-core implementation on Cortex-M4 for continuous real-time classification.

求助该文献

最长约 10秒，即可获得该文献文件

FANN-on-MCU: An Open-Source Toolkit for Energy-Efficient Neural Network Inference at the Edge of the Internet of Things

今日热心研友