Picture for Junkang Wu

Junkang Wu

Aligning Multimodal LLM with Human Preference: A Survey

Add code
Mar 18, 2025
Viaarxiv icon

RePO: ReLU-based Preference Optimization

Add code
Mar 10, 2025
Viaarxiv icon

DAMO: Data- and Model-aware Alignment of Multi-modal LLMs

Add code
Feb 04, 2025
Viaarxiv icon

$α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs

Add code
Oct 14, 2024
Figure 1 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 2 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 3 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 4 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Viaarxiv icon

$β$-DPO: Direct Preference Optimization with Dynamic $β$

Add code
Jul 11, 2024
Viaarxiv icon

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

Add code
Jul 10, 2024
Figure 1 for Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
Figure 2 for Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
Figure 3 for Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
Figure 4 for Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
Viaarxiv icon

Direct Multi-Turn Preference Optimization for Language Agents

Add code
Jun 25, 2024
Viaarxiv icon

Lower-Left Partial AUC: An Effective and Efficient Optimization Metric for Recommendation

Add code
Feb 29, 2024
Viaarxiv icon

BSL: Understanding and Improving Softmax Loss for Recommendation

Add code
Dec 20, 2023
Viaarxiv icon

Understanding Contrastive Learning via Distributionally Robust Optimization

Add code
Oct 17, 2023
Figure 1 for Understanding Contrastive Learning via Distributionally Robust Optimization
Figure 2 for Understanding Contrastive Learning via Distributionally Robust Optimization
Figure 3 for Understanding Contrastive Learning via Distributionally Robust Optimization
Figure 4 for Understanding Contrastive Learning via Distributionally Robust Optimization
Viaarxiv icon