Picture for Haoning Wu

Haoning Wu

Kimi-VL Technical Report

Add code
Apr 10, 2025
Viaarxiv icon

Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning

Add code
Apr 02, 2025
Viaarxiv icon

Image Quality Assessment: From Human to Machine Preference

Add code
Mar 13, 2025
Viaarxiv icon

Generative Frame Sampler for Long Video Understanding

Add code
Mar 12, 2025
Viaarxiv icon

Teaching LMMs for Image Quality Scoring and Interpreting

Add code
Mar 12, 2025
Viaarxiv icon

ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Add code
Mar 10, 2025
Viaarxiv icon

MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities

Add code
Dec 04, 2024
Figure 1 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Figure 2 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Figure 3 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Figure 4 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Viaarxiv icon

Towards Universal Soccer Video Understanding

Add code
Dec 02, 2024
Viaarxiv icon

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Add code
Nov 20, 2024
Figure 1 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 2 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 3 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 4 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Viaarxiv icon

VQA$^2$:Visual Question Answering for Video Quality Assessment

Add code
Nov 06, 2024
Figure 1 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 2 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 3 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 4 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Viaarxiv icon