Picture for Alec Koppel

Alec Koppel

Approximate Equivariance in Reinforcement Learning

Add code
Nov 06, 2024
Viaarxiv icon

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment

Add code
Oct 10, 2024
Figure 1 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 2 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 3 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 4 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Viaarxiv icon

Partially Observable Contextual Bandits with Linear Payoffs

Add code
Sep 17, 2024
Viaarxiv icon

SAIL: Self-Improving Efficient Online Alignment of Large Language Models

Add code
Jun 21, 2024
Viaarxiv icon

Compressed Online Learning of Conditional Mean Embedding

Add code
May 13, 2024
Viaarxiv icon

Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic

Add code
Mar 18, 2024
Viaarxiv icon

Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective

Add code
Mar 17, 2024
Viaarxiv icon

Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning

Add code
Mar 13, 2024
Figure 1 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 2 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 3 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 4 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Viaarxiv icon

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

Add code
Feb 14, 2024
Viaarxiv icon

Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach

Add code
Nov 18, 2023
Figure 1 for Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach
Figure 2 for Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach
Figure 3 for Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach
Figure 4 for Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach
Viaarxiv icon