Picture for Haoning Wu

Haoning Wu

BabyVision: Visual Reasoning Beyond Language

Add code
Jan 10, 2026
Viaarxiv icon

SoccerMaster: A Vision Foundation Model for Soccer Understanding

Add code
Dec 11, 2025
Viaarxiv icon

VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results

Add code
Sep 11, 2025
Figure 1 for VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results
Figure 2 for VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results
Figure 3 for VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results
Figure 4 for VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results
Viaarxiv icon

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Add code
May 29, 2025
Viaarxiv icon

Scaling-up Perceptual Video Quality Assessment

Add code
May 28, 2025
Viaarxiv icon

SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding

Add code
May 22, 2025
Viaarxiv icon

Multi-Agent System for Comprehensive Soccer Understanding

Add code
May 06, 2025
Viaarxiv icon

Kimi-VL Technical Report

Add code
Apr 10, 2025
Figure 1 for Kimi-VL Technical Report
Figure 2 for Kimi-VL Technical Report
Figure 3 for Kimi-VL Technical Report
Figure 4 for Kimi-VL Technical Report
Viaarxiv icon

Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning

Add code
Apr 02, 2025
Viaarxiv icon

Image Quality Assessment: From Human to Machine Preference

Add code
Mar 13, 2025
Viaarxiv icon