计算机科学
分割
人工智能
模式识别(心理学)
采样(信号处理)
图像分割
数据挖掘
计算机视觉
滤波器(信号处理)
作者
Jevgenij Gamper,Navid Alemi Koohbanani,Ksenija Benet,Ali Khuram,Nasir Rajpoot
标识
DOI:10.1007/978-3-030-23937-4_2
摘要
In this work we present an experimental setup to semi automatically obtain exhaustive nuclei labels across 19 different tissue types, and therefore construct a large pan-cancer dataset for nuclei instance segmentation and classification, with minimal sampling bias. The dataset consists of 455 visual fields, of which 312 are randomly sampled from more than 20K whole slide images at different magnifications, from multiple data sources. In total the dataset contains 216.4K labeled nuclei, each with an instance segmentation mask. We independently pursue three separate streams to create the dataset: detection, classification, and instance segmentation by ensembling in total 34 models from already existing, public datasets, therefore showing that the learnt knowledge can be efficiently transferred to create new datasets. All three streams are either validated on existing public benchmarks or validated by expert pathologists, and finally merged and validated once again to create a large, comprehensive pan-cancer nuclei segmentation and detection dataset PanNuke.
科研通智能强力驱动
Strongly Powered by AbleSci AI