Picture for Haoning Wu

Haoning Wu

MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities

Add code
Dec 04, 2024
Viaarxiv icon

Towards Universal Soccer Video Understanding

Add code
Dec 02, 2024
Viaarxiv icon

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Add code
Nov 20, 2024
Figure 1 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 2 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 3 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 4 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Viaarxiv icon

VQA$^2$:Visual Question Answering for Video Quality Assessment

Add code
Nov 06, 2024
Viaarxiv icon

Aria: An Open Multimodal Native Mixture-of-Experts Model

Add code
Oct 08, 2024
Figure 1 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 2 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 3 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 4 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Viaarxiv icon

R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?

Add code
Oct 07, 2024
Figure 1 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 2 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 3 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 4 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Viaarxiv icon

Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs

Add code
Sep 30, 2024
Figure 1 for Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
Figure 2 for Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
Figure 3 for Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
Figure 4 for Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
Viaarxiv icon

Explore the Hallucination on Low-level Perception for MLLMs

Add code
Sep 15, 2024
Figure 1 for Explore the Hallucination on Low-level Perception for MLLMs
Figure 2 for Explore the Hallucination on Low-level Perception for MLLMs
Figure 3 for Explore the Hallucination on Low-level Perception for MLLMs
Figure 4 for Explore the Hallucination on Low-level Perception for MLLMs
Viaarxiv icon

LIME-M: Less Is More for Evaluation of MLLMs

Add code
Sep 10, 2024
Figure 1 for LIME-M: Less Is More for Evaluation of MLLMs
Figure 2 for LIME-M: Less Is More for Evaluation of MLLMs
Figure 3 for LIME-M: Less Is More for Evaluation of MLLMs
Figure 4 for LIME-M: Less Is More for Evaluation of MLLMs
Viaarxiv icon

MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning

Add code
Aug 20, 2024
Viaarxiv icon