Picture for Kang Rong

Kang Rong

SAIL: Self-Amplified Iterative Learning for Diffusion Model Alignment with Minimal Human Feedback

Add code
Feb 05, 2026
Viaarxiv icon

WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning

Add code
Jun 09, 2025
Viaarxiv icon

Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs

Add code
Mar 26, 2025
Figure 1 for Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Figure 2 for Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Figure 3 for Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Figure 4 for Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Viaarxiv icon

Automated Multi-level Preference for MLLMs

Add code
May 18, 2024
Figure 1 for Automated Multi-level Preference for MLLMs
Figure 2 for Automated Multi-level Preference for MLLMs
Figure 3 for Automated Multi-level Preference for MLLMs
Figure 4 for Automated Multi-level Preference for MLLMs
Viaarxiv icon

HARIS: Human-Like Attention for Reference Image Segmentation

Add code
May 17, 2024
Viaarxiv icon