Picture for Matthew Riemer

Matthew Riemer

Can Large Language Models Adapt to Other Agents In-Context?

Add code
Dec 27, 2024
Viaarxiv icon

Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference

Add code
Dec 18, 2024
Figure 1 for Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Figure 2 for Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Figure 3 for Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Figure 4 for Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Viaarxiv icon

Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMs

Add code
Nov 11, 2024
Figure 1 for Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMs
Figure 2 for Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMs
Figure 3 for Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMs
Figure 4 for Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMs
Viaarxiv icon

Game-Theoretical Perspectives on Active Equilibria: A Preferred Solution Concept over Nash Equilibria

Add code
Oct 28, 2022
Viaarxiv icon

Influencing Long-Term Behavior in Multiagent Reinforcement Learning

Add code
Mar 07, 2022
Figure 1 for Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Figure 2 for Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Figure 3 for Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Figure 4 for Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Viaarxiv icon

Continual Learning In Environments With Polynomial Mixing Times

Add code
Dec 13, 2021
Figure 1 for Continual Learning In Environments With Polynomial Mixing Times
Figure 2 for Continual Learning In Environments With Polynomial Mixing Times
Figure 3 for Continual Learning In Environments With Polynomial Mixing Times
Figure 4 for Continual Learning In Environments With Polynomial Mixing Times
Viaarxiv icon

Context-Specific Representation Abstraction for Deep Option Learning

Add code
Sep 20, 2021
Figure 1 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 2 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 3 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 4 for Context-Specific Representation Abstraction for Deep Option Learning
Viaarxiv icon

Towards Continual Reinforcement Learning: A Review and Perspectives

Add code
Dec 25, 2020
Figure 1 for Towards Continual Reinforcement Learning: A Review and Perspectives
Figure 2 for Towards Continual Reinforcement Learning: A Review and Perspectives
Figure 3 for Towards Continual Reinforcement Learning: A Review and Perspectives
Figure 4 for Towards Continual Reinforcement Learning: A Review and Perspectives
Viaarxiv icon

Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games

Add code
Nov 23, 2020
Figure 1 for Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games
Figure 2 for Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games
Figure 3 for Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games
Figure 4 for Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games
Viaarxiv icon

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

Add code
Oct 31, 2020
Figure 1 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 2 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 3 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 4 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Viaarxiv icon