Picture for Brandon Cui

Brandon Cui

Critique-out-Loud Reward Models

Add code
Aug 21, 2024
Viaarxiv icon

K-level Reasoning for Zero-Shot Coordination in Hanabi

Add code
Jul 14, 2022
Figure 1 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 2 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 3 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 4 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Viaarxiv icon

CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research

Add code
Sep 17, 2021
Figure 1 for CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Figure 2 for CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Figure 3 for CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Figure 4 for CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Viaarxiv icon

Learning Space Partitions for Path Planning

Add code
Jul 14, 2021
Figure 1 for Learning Space Partitions for Path Planning
Figure 2 for Learning Space Partitions for Path Planning
Figure 3 for Learning Space Partitions for Path Planning
Figure 4 for Learning Space Partitions for Path Planning
Viaarxiv icon

Off-Belief Learning

Add code
Mar 06, 2021
Figure 1 for Off-Belief Learning
Figure 2 for Off-Belief Learning
Figure 3 for Off-Belief Learning
Figure 4 for Off-Belief Learning
Viaarxiv icon

Variational Model-based Policy Optimization

Add code
Jun 24, 2020
Figure 1 for Variational Model-based Policy Optimization
Figure 2 for Variational Model-based Policy Optimization
Figure 3 for Variational Model-based Policy Optimization
Figure 4 for Variational Model-based Policy Optimization
Viaarxiv icon

Control-Aware Representations for Model-based Reinforcement Learning

Add code
Jun 24, 2020
Figure 1 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 2 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 3 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 4 for Control-Aware Representations for Model-based Reinforcement Learning
Viaarxiv icon