Picture for Aarti Singh

Aarti Singh

Carnegie Mellon University

Expanding the Capabilities of Reinforcement Learning via Text Feedback

Add code
Feb 02, 2026
Viaarxiv icon

Online Social Welfare Function-based Resource Allocation

Add code
Feb 01, 2026
Viaarxiv icon

Cooperative Multi-agent RL with Communication Constraints

Add code
Jan 18, 2026
Viaarxiv icon

Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization

Add code
Nov 10, 2025
Viaarxiv icon

OEUVRE: OnlinE Unbiased Variance-Reduced loss Estimation

Add code
Oct 26, 2025
Viaarxiv icon

Projection Optimization: A General Framework for Multi-Objective and Multi-Group RLHF

Add code
Feb 24, 2025
Viaarxiv icon

Optimistic Algorithms for Adaptive Estimation of the Average Treatment Effect

Add code
Feb 07, 2025
Viaarxiv icon

Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment Effect

Add code
Nov 21, 2024
Viaarxiv icon

Beyond Parameter Count: Implicit Bias in Soft Mixture of Experts

Add code
Sep 02, 2024
Figure 1 for Beyond Parameter Count: Implicit Bias in Soft Mixture of Experts
Figure 2 for Beyond Parameter Count: Implicit Bias in Soft Mixture of Experts
Figure 3 for Beyond Parameter Count: Implicit Bias in Soft Mixture of Experts
Figure 4 for Beyond Parameter Count: Implicit Bias in Soft Mixture of Experts
Viaarxiv icon

Hybrid Reinforcement Learning from Offline Observation Alone

Add code
Jun 11, 2024
Figure 1 for Hybrid Reinforcement Learning from Offline Observation Alone
Figure 2 for Hybrid Reinforcement Learning from Offline Observation Alone
Figure 3 for Hybrid Reinforcement Learning from Offline Observation Alone
Figure 4 for Hybrid Reinforcement Learning from Offline Observation Alone
Viaarxiv icon