Picture for Laixi Shi

Laixi Shi

Pushing Forward Pareto Frontiers of Proactive Agents with Behavioral Agentic Optimization

Add code
Feb 11, 2026
Viaarxiv icon

Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity

Add code
Feb 03, 2026
Viaarxiv icon

MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning

Add code
May 30, 2025
Figure 1 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Figure 2 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Figure 3 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Figure 4 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Viaarxiv icon

KL-regularization Itself is Differentially Private in Bandits and RLHF

Add code
May 23, 2025
Viaarxiv icon

Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning

Add code
Feb 27, 2025
Figure 1 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 2 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 3 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 4 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Viaarxiv icon

Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data

Add code
Nov 06, 2024
Figure 1 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 2 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 3 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 4 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Viaarxiv icon

Can We Break the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning?

Add code
Sep 30, 2024
Figure 1 for Can We Break the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning?
Viaarxiv icon

BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning

Add code
Jul 15, 2024
Figure 1 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Figure 2 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Figure 3 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Figure 4 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Viaarxiv icon

Distributionally Robust Constrained Reinforcement Learning under Strong Duality

Add code
Jun 22, 2024
Figure 1 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Figure 2 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Figure 3 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Viaarxiv icon

Tractable Equilibrium Computation in Markov Games through Risk Aversion

Add code
Jun 20, 2024
Viaarxiv icon