Picture for Haoning Wu

Haoning Wu

Image Quality Assessment: From Human to Machine Preference

Add code
Mar 13, 2025
Viaarxiv icon

Generative Frame Sampler for Long Video Understanding

Add code
Mar 12, 2025
Viaarxiv icon

Teaching LMMs for Image Quality Scoring and Interpreting

Add code
Mar 12, 2025
Viaarxiv icon

ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Add code
Mar 10, 2025
Viaarxiv icon

MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities

Add code
Dec 04, 2024
Figure 1 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Figure 2 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Figure 3 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Figure 4 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Viaarxiv icon

Towards Universal Soccer Video Understanding

Add code
Dec 02, 2024
Viaarxiv icon

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Add code
Nov 20, 2024
Figure 1 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 2 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 3 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 4 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Viaarxiv icon

VQA$^2$:Visual Question Answering for Video Quality Assessment

Add code
Nov 06, 2024
Figure 1 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 2 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 3 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 4 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Viaarxiv icon

Aria: An Open Multimodal Native Mixture-of-Experts Model

Add code
Oct 08, 2024
Figure 1 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 2 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 3 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 4 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Viaarxiv icon

R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?

Add code
Oct 07, 2024
Figure 1 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 2 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 3 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 4 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Viaarxiv icon