计算机科学
计算机视觉
人工智能
延迟(音频)
RGB颜色模型
帧速率
智能摄像头
低延迟(资本市场)
图像传感器
利用
事件(粒子物理)
实时计算
电信
物理
量子力学
计算机网络
计算机安全
作者
Daniel Gehrig,Davide Scaramuzza
出处
期刊:Nature
[Springer Nature]
日期:2024-05-29
卷期号:629 (8014): 1034-1040
标识
DOI:10.1038/s41586-024-07409-w
摘要
Abstract The computer vision algorithms used currently in advanced driver assistance systems rely on image-based RGB cameras, leading to a critical bandwidth–latency trade-off for delivering safe driving experiences. To address this, event cameras have emerged as alternative vision sensors. Event cameras measure the changes in intensity asynchronously, offering high temporal resolution and sparsity, markedly reducing bandwidth and latency requirements 1 . Despite these advantages, event-camera-based algorithms are either highly efficient but lag behind image-based ones in terms of accuracy or sacrifice the sparsity and efficiency of events to achieve comparable results. To overcome this, here we propose a hybrid event- and frame-based object detector that preserves the advantages of each modality and thus does not suffer from this trade-off. Our method exploits the high temporal resolution and sparsity of events and the rich but low temporal resolution information in standard images to generate efficient, high-rate object detections, reducing perceptual and computational latency. We show that the use of a 20 frames per second (fps) RGB camera plus an event camera can achieve the same latency as a 5,000-fps camera with the bandwidth of a 45-fps camera without compromising accuracy. Our approach paves the way for efficient and robust perception in edge-case scenarios by uncovering the potential of event cameras 2 .
科研通智能强力驱动
Strongly Powered by AbleSci AI