Picture for Guangtao Zhai

Guangtao Zhai

VQA$^2$:Visual Question Answering for Video Quality Assessment

Add code
Nov 06, 2024
Viaarxiv icon

On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection

Add code
Oct 31, 2024
Viaarxiv icon

ResAD: A Simple Framework for Class Generalizable Anomaly Detection

Add code
Oct 26, 2024
Viaarxiv icon

MMHead: Towards Fine-grained Multi-modal 3D Facial Animation

Add code
Oct 10, 2024
Figure 1 for MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
Figure 2 for MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
Figure 3 for MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
Figure 4 for MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
Viaarxiv icon

R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?

Add code
Oct 07, 2024
Viaarxiv icon

AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results

Add code
Oct 05, 2024
Figure 1 for AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results
Figure 2 for AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results
Figure 3 for AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results
Figure 4 for AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results
Viaarxiv icon

Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs

Add code
Sep 30, 2024
Viaarxiv icon

Subjective and Objective Quality-of-Experience Evaluation Study for Live Video Streaming

Add code
Sep 26, 2024
Viaarxiv icon

Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression

Add code
Sep 18, 2024
Figure 1 for Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression
Figure 2 for Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression
Figure 3 for Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression
Figure 4 for Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression
Viaarxiv icon

Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending

Add code
Sep 17, 2024
Viaarxiv icon