计算机科学
人工智能
失真(音乐)
能见度
质量(理念)
视频质量
计算机视觉
图像质量
主观视频质量
可视化
透视图(图形)
特征提取
亮度
模式识别(心理学)
图像(数学)
放大器
公制(单位)
带宽(计算)
经济
光学
哲学
物理
认识论
计算机网络
运营管理
作者
Guanghui Yue,Lixin Zhang,Jingfeng Du,Tianwei Zhou,Wei Zhou,Weisi Lin
标识
DOI:10.1109/tmi.2024.3461737
摘要
Captured colonoscopy videos usually suffer from multiple real-world distortions, such as motion blur, low brightness, abnormal exposure, and object occlusion, which impede visual interpretation. However, existing works mainly investigate the impacts of synthesized distortions, which differ from real-world distortions greatly. This research aims to carry out an in-depth study for colonoscopy Video Quality Assessment (VQA). In this study, we advance this topic by establishing both subjective and objective solutions. Firstly, we collect 1,000 colonoscopy videos with typical visual quality degradation conditions in practice and construct a multi-attribute VQA database. The quality of each video is annotated by subjective experiments from five distortion attributes (i.e., temporal-spatial visibility, brightness, specular reflection, stability, and utility), as well as an overall perspective. Secondly, we propose a Distortion Attribute Reasoning Network (DARNet) for automatic VQA. DARNet includes two streams to extract features related to spatial and temporal distortions, respectively. It adaptively aggregates the attribute-related features through a multi-attribute association module to predict the quality score of each distortion attribute. Motivated by the observation that the rating behaviors for all attributes are different, a behavior guided reasoning module is further used to fuse the attribute-aware features, resulting in the overall quality. Experimental results on the constructed database show that our DARNet correlates well with subjective ratings and is superior to nine state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI