Picture for Haodong Duan

Haodong Duan

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Add code
Mar 19, 2025
Viaarxiv icon

Image Quality Assessment: From Human to Machine Preference

Add code
Mar 13, 2025
Viaarxiv icon

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Add code
Mar 13, 2025
Viaarxiv icon

Information Density Principle for MLLM Benchmarks

Add code
Mar 13, 2025
Viaarxiv icon

Visual-RFT: Visual Reinforcement Fine-Tuning

Add code
Mar 03, 2025
Viaarxiv icon

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Add code
Feb 25, 2025
Viaarxiv icon

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Add code
Feb 07, 2025
Viaarxiv icon

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Add code
Jan 21, 2025
Figure 1 for InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model
Figure 2 for InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model
Figure 3 for InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model
Figure 4 for InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model
Viaarxiv icon

Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement

Add code
Jan 21, 2025
Figure 1 for Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
Figure 2 for Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
Figure 3 for Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
Figure 4 for Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
Viaarxiv icon

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Add code
Jan 09, 2025
Viaarxiv icon