Picture for Guangtao Zhai

Guangtao Zhai

Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model

Add code
Apr 09, 2025
Viaarxiv icon

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Add code
Apr 03, 2025
Viaarxiv icon

Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes

Add code
Apr 02, 2025
Viaarxiv icon

Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy

Add code
Mar 27, 2025
Viaarxiv icon

Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing

Add code
Mar 25, 2025
Viaarxiv icon

Variational Bayesian Personalized Ranking

Add code
Mar 14, 2025
Viaarxiv icon

Information Density Principle for MLLM Benchmarks

Add code
Mar 13, 2025
Viaarxiv icon

Image Quality Assessment: From Human to Machine Preference

Add code
Mar 13, 2025
Viaarxiv icon

Teaching LMMs for Image Quality Scoring and Interpreting

Add code
Mar 12, 2025
Viaarxiv icon

Towards All-in-One Medical Image Re-Identification

Add code
Mar 11, 2025
Viaarxiv icon