Picture for Ni Mu

Ni Mu

SC2Arena and StarEvolve: Benchmark and Self-Improvement Framework for LLMs in Complex Decision-Making Tasks

Add code
Aug 14, 2025
Viaarxiv icon

Preference-based Multi-Objective Reinforcement Learning

Add code
Jul 18, 2025
Viaarxiv icon

S-EPOA: Overcoming the Indivisibility of Annotations with Skill-Driven Preference-Based Reinforcement Learning

Add code
Aug 22, 2024
Figure 1 for S-EPOA: Overcoming the Indivisibility of Annotations with Skill-Driven Preference-Based Reinforcement Learning
Figure 2 for S-EPOA: Overcoming the Indivisibility of Annotations with Skill-Driven Preference-Based Reinforcement Learning
Figure 3 for S-EPOA: Overcoming the Indivisibility of Annotations with Skill-Driven Preference-Based Reinforcement Learning
Figure 4 for S-EPOA: Overcoming the Indivisibility of Annotations with Skill-Driven Preference-Based Reinforcement Learning
Viaarxiv icon

E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance

Add code
Dec 05, 2022
Viaarxiv icon