计算机科学
降噪
分割
推论
噪音(视频)
编码(集合论)
人工智能
过程(计算)
模式识别(心理学)
图像去噪
扩散
图像分割
图像(数学)
算法
程序设计语言
物理
热力学
集合(抽象数据类型)
作者
Zhangxuan Gu,Haoxing Chen,Zhuoer Xu
标识
DOI:10.1109/icassp48485.2024.10447191
摘要
Diffusion frameworks have achieved comparable performance with previous state-of-the-art image generation models. This paper proposes DiffusionInst, a novel framework representing instances as vectors and formulates instance segmentation as a noise-to-vector denoising process. The model is trained to reverse the noisy groundtruth mask without any inductive bias from RPN. It takes a randomly generated vector as input and outputs mask with multi-step denoising during inference. Extensive experimental results on COCO and LVIS show that DiffusionInst achieves competitive performance. Our code is available at https://github.com/chenhaoxing/DiffusionInst.
科研通智能强力驱动
Strongly Powered by AbleSci AI