Picture for Tong Mu

Tong Mu

Rule Based Rewards for Language Model Safety

Add code
Nov 02, 2024
Viaarxiv icon

Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning

Add code
Jun 14, 2023
Viaarxiv icon

MC-MLP:Multiple Coordinate Frames in all-MLP Architecture for Vision

Add code
Apr 08, 2023
Viaarxiv icon

Modeling Bounded Rationality in Multi-Agent Simulations Using Rationally Inattentive Reinforcement Learning

Add code
Jan 18, 2022
Figure 1 for Modeling Bounded Rationality in Multi-Agent Simulations Using Rationally Inattentive Reinforcement Learning
Figure 2 for Modeling Bounded Rationality in Multi-Agent Simulations Using Rationally Inattentive Reinforcement Learning
Figure 3 for Modeling Bounded Rationality in Multi-Agent Simulations Using Rationally Inattentive Reinforcement Learning
Figure 4 for Modeling Bounded Rationality in Multi-Agent Simulations Using Rationally Inattentive Reinforcement Learning
Viaarxiv icon

Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning

Add code
Dec 30, 2021
Figure 1 for Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning
Figure 2 for Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning
Figure 3 for Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning
Figure 4 for Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning
Viaarxiv icon

PLOTS: Procedure Learning from Observations using Subtask Structure

Add code
Apr 17, 2019
Figure 1 for PLOTS: Procedure Learning from Observations using Subtask Structure
Figure 2 for PLOTS: Procedure Learning from Observations using Subtask Structure
Figure 3 for PLOTS: Procedure Learning from Observations using Subtask Structure
Figure 4 for PLOTS: Procedure Learning from Observations using Subtask Structure
Viaarxiv icon