摄影
机器视觉
计算机科学
人工智能
计算摄影
管道(软件)
视觉科学
生成语法
快照(计算机存储)
计算机视觉
视觉艺术
图像处理
图像(数学)
艺术
程序设计语言
操作系统
作者
nicolas maleve,Katrina Sluis
标识
DOI:10.1215/2834703x-10734066
摘要
Abstract Despite computer vision's extensive mobilization of cameras, photographers, and viewing subjects, photography's place in machine vision remains undertheorized. This article illuminates an operative theory of photography that exists in a latent form, embedded in the tools, practices, and discourses of machine vision research and enabling the methodological imperatives of dataset production. Focusing on the development of the canonical object recognition dataset ImageNet, the article analyzes how the dataset pipeline translates the radical polysemy of the photographic image into a stable and transparent form of data that can be portrayed as a proxy of human vision. Reflecting on the prominence of the photographic snapshot in machine vision discourse, the article traces the path that made this popular cultural practice amenable to the dataset. Following the evolution from nineteenth-century scientific photography to the acquisition of massive sets of online photos, the article shows how dataset creators inherit and transform a form of “instrumental realism,” a photographic enterprise that aims to establish a generalized look from contingent instances in the pursuit of statistical truth. The article concludes with a reflection on how the latent photographic theory of machine vision we have advanced relates to the large image models built for generative AI today.
科研通智能强力驱动
Strongly Powered by AbleSci AI