Picture for Junkang Wu

Junkang Wu

$α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs

Add code
Oct 14, 2024
Figure 1 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 2 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 3 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 4 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Viaarxiv icon

$β$-DPO: Direct Preference Optimization with Dynamic $β$

Add code
Jul 11, 2024
Viaarxiv icon

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

Add code
Jul 10, 2024
Figure 1 for Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
Figure 2 for Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
Figure 3 for Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
Figure 4 for Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
Viaarxiv icon

Direct Multi-Turn Preference Optimization for Language Agents

Add code
Jun 25, 2024
Viaarxiv icon

Lower-Left Partial AUC: An Effective and Efficient Optimization Metric for Recommendation

Add code
Feb 29, 2024
Viaarxiv icon

BSL: Understanding and Improving Softmax Loss for Recommendation

Add code
Dec 20, 2023
Viaarxiv icon

Understanding Contrastive Learning via Distributionally Robust Optimization

Add code
Oct 17, 2023
Figure 1 for Understanding Contrastive Learning via Distributionally Robust Optimization
Figure 2 for Understanding Contrastive Learning via Distributionally Robust Optimization
Figure 3 for Understanding Contrastive Learning via Distributionally Robust Optimization
Figure 4 for Understanding Contrastive Learning via Distributionally Robust Optimization
Viaarxiv icon

On the Theories Behind Hard Negative Sampling for Recommendation

Add code
Feb 19, 2023
Figure 1 for On the Theories Behind Hard Negative Sampling for Recommendation
Figure 2 for On the Theories Behind Hard Negative Sampling for Recommendation
Figure 3 for On the Theories Behind Hard Negative Sampling for Recommendation
Figure 4 for On the Theories Behind Hard Negative Sampling for Recommendation
Viaarxiv icon

Adap-tau: Adaptively Modulating Embedding Magnitude for Recommendation

Add code
Feb 09, 2023
Viaarxiv icon

FFHR: Fully and Flexible Hyperbolic Representation for Knowledge Graph Completion

Add code
Feb 07, 2023
Viaarxiv icon