计算机科学
透视图(图形)
人工智能
对象(语法)
期限(时间)
平面图(考古学)
场景统计
像素
计算机视觉
机器学习
感知
物理
历史
考古
神经科学
生物
量子力学
作者
Xiaotian Qiao,Quanlong Zheng,Ying Cao,Rynson W. H. Lau
标识
DOI:10.1007/s11263-021-01560-x
摘要
Forecasting scene layout is of vital importance in many vision applications, e.g., enabling autonomous vehicles to plan actions early. It is a challenging problem as it involves understanding of the past scene layouts and the diverse object interactions in the scene, and then forecasting what the scene will look like at a future time. Prior works learn a direct mapping from past pixels to future pixel-wise labels and ignore the underlying object interactions in the scene, resulting in temporally incoherent and averaged predictions. In this paper, we propose a learning framework to forecast semantic scene layouts (represented by instance maps) from an instance-aware perspective. Specifically, our framework explicitly models the dynamics of individual instances and captures their interactions in a scene. Under this formulation, we are able to enforce instance-level constraints to forecast scene layouts by effectively reasoning about their spatial and semantic relations. Experimental results show that our model can predict sharper and more accurate future instance maps than the baselines and prior methods, yielding state-of-the-art performances on short-term, mid-term and long-term scene layout forecasting.
科研通智能强力驱动
Strongly Powered by AbleSci AI