Picture for Yali Du

Yali Du

SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas

Add code
Mar 18, 2025
Viaarxiv icon

GRU: Mitigating the Trade-off between Unlearning and Retention for Large Language Models

Add code
Mar 12, 2025
Viaarxiv icon

M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality

Add code
Mar 06, 2025
Viaarxiv icon

ATLaS: Agent Tuning via Learning Critical Steps

Add code
Mar 04, 2025
Viaarxiv icon

$\text{M}^3\text{HF}$: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality

Add code
Mar 03, 2025
Viaarxiv icon

CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation

Add code
Feb 28, 2025
Viaarxiv icon

Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?

Add code
Feb 26, 2025
Viaarxiv icon

VLP: Vision-Language Preference Learning for Embodied Manipulation

Add code
Feb 17, 2025
Viaarxiv icon

RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors

Add code
Dec 14, 2024
Viaarxiv icon

RuAG: Learned-rule-augmented Generation for Large Language Models

Add code
Nov 04, 2024
Figure 1 for RuAG: Learned-rule-augmented Generation for Large Language Models
Figure 2 for RuAG: Learned-rule-augmented Generation for Large Language Models
Figure 3 for RuAG: Learned-rule-augmented Generation for Large Language Models
Figure 4 for RuAG: Learned-rule-augmented Generation for Large Language Models
Viaarxiv icon