计算机科学
预测(人工智能)
背景(考古学)
图形
人工智能
编码(集合论)
机器学习
理论计算机科学
生物
古生物学
集合(抽象数据类型)
程序设计语言
作者
Wenfeng Song,Shuai Li,Tao Chang,Ke Xie,Aimin Hao,Hong Qin
标识
DOI:10.1016/j.patcog.2023.110071
摘要
Accident anticipation (or the prediction of abnormal events in general) aims to forecast accidents before they occur by assessing risks based on the preceding frames in videos. The risk assessment heavily relies on understanding the semantics of the scene context and predicting the interactions among the involved subjects. Indeed, the comprehensive utilization of spatial relationships among the subjects of immediate interest in a single frame and temporal dependencies across consecutive frames is crucial for video accident anticipation. To address this challenge, we propose a novel approach called Dynamic Attention Augmented Graph Network (DAA-GNN), which leverages underlying spatial cues and models relationships among detected subjects of immediate interest. Specifically, our approach employs a graph neural network that is enhanced by global context clues, allowing for effective message propagation and the discovery of interactions among the subjects of interest in the scene. The DAA-GNN includes a temporal attention module designed to identify long-term dependencies along the temporal axis, contributing to an end-to-end deep network solution for accurate accident anticipation. We extensively evaluate our method on the publicly-available Dashcam Accident Dataset (DAD) and Epic Fail (EF) datasets, conducting comprehensive experiments to assess its performance. The results unequivocally demonstrate that our method outperforms the state-of-the-art accident anticipation methods. Our code and datasets will be made publicly available to facilitate future research and reproducibility.
科研通智能强力驱动
Strongly Powered by AbleSci AI