Picture for Adam Wierman

Adam Wierman

Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity

Add code
Feb 03, 2026
Viaarxiv icon

SCaLE: Switching Cost aware Learning and Exploration

Add code
Jan 14, 2026
Viaarxiv icon

Fairness-Regularized Online Optimization with Switching Costs

Add code
Dec 11, 2025
Viaarxiv icon

Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees

Add code
May 25, 2025
Viaarxiv icon

KL-regularization Itself is Differentially Private in Bandits and RLHF

Add code
May 23, 2025
Viaarxiv icon

Fusing Reward and Dueling Feedback in Stochastic Bandits

Add code
Apr 22, 2025
Viaarxiv icon

Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning

Add code
Feb 27, 2025
Figure 1 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 2 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 3 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 4 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Viaarxiv icon

Towards Environmentally Equitable AI

Add code
Dec 21, 2024
Figure 1 for Towards Environmentally Equitable AI
Viaarxiv icon

Communication Efficient Decentralization for Smoothed Online Convex Optimization

Add code
Nov 13, 2024
Figure 1 for Communication Efficient Decentralization for Smoothed Online Convex Optimization
Figure 2 for Communication Efficient Decentralization for Smoothed Online Convex Optimization
Figure 3 for Communication Efficient Decentralization for Smoothed Online Convex Optimization
Figure 4 for Communication Efficient Decentralization for Smoothed Online Convex Optimization
Viaarxiv icon

Safe Exploitative Play with Untrusted Type Beliefs

Add code
Nov 12, 2024
Figure 1 for Safe Exploitative Play with Untrusted Type Beliefs
Figure 2 for Safe Exploitative Play with Untrusted Type Beliefs
Figure 3 for Safe Exploitative Play with Untrusted Type Beliefs
Figure 4 for Safe Exploitative Play with Untrusted Type Beliefs
Viaarxiv icon