Benchmarking dynamic neural-network models of the human speed-accuracy tradeoff

计算机科学水准点（测量）延迟（音频）人工神经网络计算人工智能航程（航空）算法模式识别（心理学）大地测量学电信复合材料材料科学地理

作者

Ajay Subramanian,Elena Sizikova,Omkar Kumbhar,Najib J. Majaj,Denis G. Pelli

出处

期刊：Journal of Vision [Association for Research in Vision and Ophthalmology]
日期：2022-12-05 卷期号：22 (14): 4359-4359 被引量：2

链接

doi.orgdoi.org

标识

DOI：10.1167/jov.22.14.4359

摘要

People take a variable amount of time (0.1 - 10 s) to recognize an object and can trade speed for accuracy. Various time-constrained tasks demand a wide range of accuracy and latency. Previous work (Spoerer’20) has modeled only modest speed-accuracy tradeoffs (SATs) with a min-to-max range of merely 6% accuracy and 200 ms reaction time, a tiny fraction of the human range. Here, we collect and present a public human benchmark where we use image perturbations to adjust task difficulty and increase the accuracy range to more than 50%. Furthermore, we show that dynamic neural networks are a promising model of the SAT and capture the behavior without needing recurrence. 142 online participants categorized CIFAR-10 images with controlled reaction time. Reaction time (RT) was defined as the elapsed time between stimulus presentation and a keypress response. We ran 5 blocks of 300 trials, each with a different reaction time from 200-1000 ms and repeated the experiment with 4 different viewing conditions: color, grayscale, noise, and blur. Three networks: MSDNet (Huang’17), SCAN (Zhang’19), and ConvRNN (Spoerer’20) were trained on CIFAR-10 image classification. Using FLOPs as an analogue for human reaction time, we tested these networks by forcing them to “respond” using different amounts of computation, across all viewing conditions. We compared the three networks and humans using two metrics: accuracy range (difference between maximum and minimum accuracy when reaction time is varied) and correlation between speed-accuracy trade-off curves. MSDNet gives a better account than previous attempts without needing recurrence. When trained with noise, it shows high correlation (0.93) with human SAT. However, humans are much more flexible, with a large 51% accuracy range while the best network, MSDNet trained with noise, shows only 19%. Thus, our benchmark presents a challenging goal for future work that aims to model SAT.

求助该文献

Benchmarking dynamic neural-network models of the human speed-accuracy tradeoff

今日热心研友