计算机科学
推论
帧(网络)
集合(抽象数据类型)
人工智能
机器学习
机器视觉
选择(遗传算法)
面子(社会学概念)
社会科学
电信
社会学
程序设计语言
作者
Basar Kutukcu,Sabur Baidya,Anand Raghunathan,Sujit Dey
标识
DOI:10.1109/aicas51828.2021.9458468
摘要
Real-time machine vision applications running on resource-constrained embedded systems face challenges for maintaining performance. An especially challenging scenario arises when multiple applications execute at the same time, creating contention for the computational resources of the system. This contention results in increase in inference delay of the machine vision applications which can be unacceptable for time-critical tasks. To address this challenge, we propose an adaptive model selection framework to mitigate the impact of system contention and prevent unexpected increase in inference delay by trading off the application accuracy minimally. The framework uses a set of hierarchical deep learning models for image classification. It predicts the inference delays of each model and selects the optimal model for each frame considering the system contention. Compared to a fixed individual model with similar accuracy, our framework improves the performance by significantly reducing the inference delay violations against a practical threshold. We implement our framework on Nvidia Jetson TX2 and show that our approach achieves a gain over the individual model by 27.6% reductions in delay violations.
科研通智能强力驱动
Strongly Powered by AbleSci AI