计算机科学
目标检测
推论
建筑
管道(软件)
人工智能
延迟(音频)
深度学习
修剪
量化(信号处理)
调度(生产过程)
机器学习
人工神经网络
实时计算
计算机视觉
模式识别(心理学)
工程类
艺术
电信
运营管理
农学
视觉艺术
生物
程序设计语言
作者
Maying Shen,Lei Mao,Joshua Chen,Justin Hsu,Xinglong Sun,Oliver Knieps,Carmen Maxim,José M. Alvarez
标识
DOI:10.1109/iv55152.2023.10186732
摘要
3D Object detection is a fundamental task in vision-based autonomous driving. Deep learning perception models achieve an outstanding performance at the expense of continuously increasing resource needs and, as such, increasing training costs. As inference time is still a priority, developers usually adopt a training pipeline where they first start using a compact architecture that yields a good trade-off between accuracy and latency. This architecture is usually found either by searching manually or by using neural architecture search approaches. Then, train the model and use light optimization techniques such as quantization to boost the model’s performance. In contrast, in this paper, we advocate for starting on a much larger model and then applying aggressive optimization to adapt the model to the resource-constraints. Our results on large-scale settings for 3D object detection demonstrate the benefits of initially focusing on maximizing the model’s accuracy and then achieving the latency requirements using network pruning.
科研通智能强力驱动
Strongly Powered by AbleSci AI