计算机科学
分割
边距(机器学习)
四叉树
人工智能
图像分割
像素
编码(集合论)
模式识别(心理学)
计算机视觉
机器学习
集合(抽象数据类型)
程序设计语言
作者
Ke Lei,Martin Danelljan,Xia Li,Yu‐Wing Tai,Chi-Keung Tang,Fisher Yu
标识
DOI:10.1109/cvpr52688.2022.00437
摘要
Two-stage and query-based instance segmentation methods have achieved remarkable results. However, their segmented masks are still very coarse. In this paper, we present Mask Transfiner for high-quality and efficient instance segmentation. Instead of operating on regular dense tensors, our Mask Transfiner decomposes and represents the image regions as a quadtree. Our transformer-based approach only processes detected error-prone tree nodes and self-corrects their errors in parallel. While these sparse pixels only constitute a small proportion of the total number, they are critical to the final mask quality. This allows Mask Transfiner to predict highly accurate instance masks, at a low computational cost. Extensive experiments demonstrate that Mask Transfiner outperforms current instance segmentation methods on three popular benchmarks, significantly improving both two-stage and query-based frameworks by a large margin of +3.0 mask AP on COCO and BDD100K, and +6.6 boundary AP on Cityscapes. Our code and trained models are available at https://github.com/SysCV/transfiner.
科研通智能强力驱动
Strongly Powered by AbleSci AI