计算机科学
分级(工程)
眼底摄影
眼底(子宫)
人工智能
摄影
糖尿病性视网膜病变
领域(数学)
计算机视觉
眼科
医学
数学
艺术
土木工程
视力
荧光血管造影
纯数学
工程类
视觉艺术
糖尿病
内分泌学
作者
Junlin Hou,Jilan Xu,Fan Xiao,Ran Zhao,Yuejie Zhang,Haidong Zou,Lina Lu,Wenwen Xue,Rui Feng
标识
DOI:10.1109/bibm55620.2022.9995459
摘要
Automatic diabetic retinopathy (DR) grading based on fundus photography has been widely explored to benefit the routine screening and early treatment. Existing researches generally focus on single-field fundus images, which have limited field of view for precise eye examinations. In clinical applications, ophthalmologists adopt two-field fundus photography as the dominating tool, where the information from each field (i.e., macula-centric and optic disc-centric) is highly correlated and complementary, and benefits comprehensive decisions. However, automatic DR grading based on two-field fundus photography remains a challenging task due to the lack of publicly available datasets and effective fusion strategies. In this work, we first construct a new benchmark dataset (DRTiD) for DR grading, consisting of 3,100 two-field fundus images. To the best of our knowledge, it is the largest public DR dataset with diverse and high-quality two-field images. Then, we propose a novel DR grading approach, namely Cross-Field Transformer (CrossFiT), to capture the correspondence between two fields as well as the long-range spatial correlations within each field. Considering the inherent two-field geometric constraints, we particularly define aligned position embeddings to preserve relative consistent position in fundus. Besides, we perform masked cross-field attention during interaction to filter the noisy relations between fields. Extensive experiments on our DRTiD dataset and a public DeepDRiD dataset demonstrate the effectiveness of our CrossFiT network. The new dataset and the source code of CrossFiT will be publicly available at https://github.com/DU-VTS/DRTiD.
科研通智能强力驱动
Strongly Powered by AbleSci AI