模块化设计
计算机科学
质量(理念)
质量评定
视频质量
可靠性工程
评价方法
工程类
运营管理
操作系统
公制(单位)
哲学
认识论
作者
Wen Wen,Mu Li,Yabin Zhang,Yiting Liao,Junlin Li,Li Zhang,Kede Ma
出处
期刊:Cornell University - arXiv
日期:2024-02-29
标识
DOI:10.48550/arxiv.2402.19276
摘要
Blind video quality assessment (BVQA) plays a pivotal role in evaluating and improving the viewing experience of end-users across a wide range of video-based platforms and services. Contemporary deep learning-based models primarily analyze the video content in its aggressively downsampled format, while being blind to the impact of actual spatial resolution and frame rate on video quality. In this paper, we propose a modular BVQA model, and a method of training it to improve its modularity. Specifically, our model comprises a base quality predictor, a spatial rectifier, and a temporal rectifier, responding to the visual content and distortion, spatial resolution, and frame rate changes on video quality, respectively. During training, spatial and temporal rectifiers are dropped out with some probabilities so as to make the base quality predictor a standalone BVQA model, which should work better with the rectifiers. Extensive experiments on both professionally-generated content and user generated content video databases show that our quality model achieves superior or comparable performance to current methods. Furthermore, the modularity of our model offers a great opportunity to analyze existing video quality databases in terms of their spatial and temporal complexities. Last, our BVQA model is cost-effective to add other quality-relevant video attributes such as dynamic range and color gamut as additional rectifiers.
科研通智能强力驱动
Strongly Powered by AbleSci AI