Video Quality Assessment


Video quality assessment is a computer vision task aiming to mimic video-based human subjective perception. The goal is to produce a MOS score, where a higher score indicates better perceptual quality. Some well-known benchmarks for this task are KoNViD-1k, LIVE VQC, YouTube UGC, and LSVQ. SROCC/PLCC/RMSE are usually used to evaluate the performance of different models.

XEmoGPT: An Explainable Multimodal Emotion Recognition Framework with Cue-Level Perception and Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

RISE-Video: Can Video Generators Decode Implicit World Rules?

Add code
Feb 05, 2026
Viaarxiv icon

Audit After Segmentation: Reference-Free Mask Quality Assessment for Language-Referred Audio-Visual Segmentation

Add code
Feb 03, 2026
Viaarxiv icon

SlowFocus: Enhancing Fine-grained Temporal Understanding in Video LLM

Add code
Feb 03, 2026
Viaarxiv icon

How well can VLMs rate audio descriptions: A multi-dimensional quantitative assessment framework

Add code
Feb 01, 2026
Viaarxiv icon

Omni-Judge: Can Omni-LLMs Serve as Human-Aligned Judges for Text-Conditioned Audio-Video Generation?

Add code
Feb 02, 2026
Viaarxiv icon

Unified Personalized Reward Model for Vision Generation

Add code
Feb 02, 2026
Viaarxiv icon

VideoAesBench: Benchmarking the Video Aesthetics Perception Capabilities of Large Multimodal Models

Add code
Jan 29, 2026
Viaarxiv icon

Are Video Generation Models Geographically Fair? An Attraction-Centric Evaluation of Global Visual Knowledge

Add code
Jan 26, 2026
Viaarxiv icon

Beyond Rigid: Benchmarking Non-Rigid Video Editing

Add code
Jan 26, 2026
Viaarxiv icon