Picture for Yun He

Yun He

Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization

Add code
Jan 31, 2025
Viaarxiv icon

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Add code
Jan 18, 2025
Viaarxiv icon

Unifying Generative and Dense Retrieval for Sequential Recommendation

Add code
Nov 27, 2024
Viaarxiv icon

Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Add code
Oct 21, 2024
Viaarxiv icon

The Perfect Blend: Redefining RLHF with Mixture of Judges

Add code
Sep 30, 2024
Figure 1 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 2 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 3 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 4 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Viaarxiv icon

PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts

Add code
Jun 07, 2023
Figure 1 for PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts
Figure 2 for PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts
Figure 3 for PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts
Figure 4 for PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts
Viaarxiv icon

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions

Add code
Apr 24, 2023
Viaarxiv icon

Evolution of Popularity Bias: Empirical Study and Debiasing

Add code
Jul 07, 2022
Figure 1 for Evolution of Popularity Bias: Empirical Study and Debiasing
Figure 2 for Evolution of Popularity Bias: Empirical Study and Debiasing
Figure 3 for Evolution of Popularity Bias: Empirical Study and Debiasing
Figure 4 for Evolution of Popularity Bias: Empirical Study and Debiasing
Viaarxiv icon

Density-preserving Deep Point Cloud Compression

Add code
Apr 27, 2022
Figure 1 for Density-preserving Deep Point Cloud Compression
Figure 2 for Density-preserving Deep Point Cloud Compression
Figure 3 for Density-preserving Deep Point Cloud Compression
Figure 4 for Density-preserving Deep Point Cloud Compression
Viaarxiv icon

MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks

Add code
Mar 14, 2022
Figure 1 for MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks
Figure 2 for MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks
Figure 3 for MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks
Figure 4 for MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks
Viaarxiv icon