计算机科学
启发式
事件(粒子物理)
生成语法
集合(抽象数据类型)
人工智能
水准点(测量)
机器学习
自然语言
代表(政治)
自然语言理解
基线(sea)
任务(项目管理)
自然语言处理
数据科学
物理
大地测量学
量子力学
政治
政治学
法学
程序设计语言
地理
操作系统
海洋学
管理
地质学
经济
作者
Jiayuan Mao,Xuelin Yang,Xikun Zhang,Noah D. Goodman,Jiajun Wu
出处
期刊:Cornell University - arXiv
日期:2023-01-01
被引量:5
标识
DOI:10.48550/arxiv.2310.03635
摘要
Building machines that can reason about physical events and their causal relationships is crucial for flexible interaction with the physical world. However, most existing physical and causal reasoning benchmarks are exclusively based on synthetically generated events and synthetic natural language descriptions of causal relationships. This design brings up two issues. First, there is a lack of diversity in both event types and natural language descriptions; second, causal relationships based on manually-defined heuristics are different from human judgments. To address both shortcomings, we present the CLEVRER-Humans benchmark, a video reasoning dataset for causal judgment of physical events with human labels. We employ two techniques to improve data collection efficiency: first, a novel iterative event cloze task to elicit a new representation of events in videos, which we term Causal Event Graphs (CEGs); second, a data augmentation technique based on neural language generative models. We convert the collected CEGs into questions and answers to be consistent with prior work. Finally, we study a collection of baseline approaches for CLEVRER-Humans question-answering, highlighting the great challenges set forth by our benchmark.
科研通智能强力驱动
Strongly Powered by AbleSci AI