Picture for Marwa Abdulhai

Marwa Abdulhai

Virtual Personas for Language Models via an Anthology of Backstories

Add code
Jul 09, 2024
Viaarxiv icon

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

Add code
Nov 30, 2023
Figure 1 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 2 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 3 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 4 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Viaarxiv icon

Moral Foundations of Large Language Models

Add code
Oct 23, 2023
Viaarxiv icon

Personality Traits in Large Language Models

Add code
Jul 01, 2023
Viaarxiv icon

Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience

Add code
Aug 09, 2022
Figure 1 for Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Figure 2 for Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Figure 3 for Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Figure 4 for Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Viaarxiv icon

Context-Specific Representation Abstraction for Deep Option Learning

Add code
Sep 20, 2021
Figure 1 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 2 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 3 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 4 for Context-Specific Representation Abstraction for Deep Option Learning
Viaarxiv icon

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

Add code
Oct 31, 2020
Figure 1 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 2 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 3 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 4 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Viaarxiv icon