Picture for Alec Koppel

Alec Koppel

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

Add code
Mar 27, 2025
Viaarxiv icon

Efficient Inverse Multiagent Learning

Add code
Feb 20, 2025
Viaarxiv icon

Nonparametric Sparse Online Learning of the Koopman Operator

Add code
Jan 27, 2025
Viaarxiv icon

Regularized Proportional Fairness Mechanism for Resource Allocation Without Money

Add code
Jan 02, 2025
Figure 1 for Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
Figure 2 for Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
Figure 3 for Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
Figure 4 for Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
Viaarxiv icon

Approximate Equivariance in Reinforcement Learning

Add code
Nov 06, 2024
Viaarxiv icon

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment

Add code
Oct 10, 2024
Figure 1 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 2 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 3 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 4 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Viaarxiv icon

Partially Observable Contextual Bandits with Linear Payoffs

Add code
Sep 17, 2024
Viaarxiv icon

SAIL: Self-Improving Efficient Online Alignment of Large Language Models

Add code
Jun 21, 2024
Viaarxiv icon

Compressed Online Learning of Conditional Mean Embedding

Add code
May 13, 2024
Viaarxiv icon

Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic

Add code
Mar 18, 2024
Viaarxiv icon