计算机科学
串联(数学)
深度学习
人工智能
特征(语言学)
管道(软件)
目标检测
卷积神经网络
卷积(计算机科学)
边缘设备
特征学习
模式识别(心理学)
转化(遗传学)
计算机视觉
人工神经网络
数学
组合数学
哲学
操作系统
基因
化学
程序设计语言
云计算
生物化学
语言学
标识
DOI:10.1016/j.patcog.2023.109451
摘要
We introduce YOGA, a deep learning based yet lightweight object detection model that can operate on low-end edge devices while still achieving competitive accuracy. The YOGA architecture consists of a two-phase feature learning pipeline with a cheap linear transformation, which learns feature maps using only half of the convolution filters required by conventional convolutional neural networks. In addition, it performs multi-scale feature fusion in its neck using an attention mechanism instead of the naive concatenation used by conventional detectors. YOGA is a flexible model that can be easily scaled up or down by several orders of magnitude to fit a broad range of hardware constraints. We evaluate YOGA on COCO-val and COCO-testdev datasets with other over 10 state-of-the-art object detectors. The results show that YOGA strikes the best trade-off between model size and accuracy (up to 22% increase of AP and 23-34% reduction of parameters and FLOPs), making it an ideal choice for deployment in the wild on low-end edge devices. This is further affirmed by our hardware implementation and evaluation on NVIDIA Jetson Nano.
科研通智能强力驱动
Strongly Powered by AbleSci AI