Picture for Matthew M. Botvinick

Matthew M. Botvinick

How should the advent of large language models affect the practice of science?

Add code
Dec 05, 2023
Viaarxiv icon

Fine-tuning language models to find agreement among humans with diverse preferences

Add code
Nov 28, 2022
Figure 1 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 2 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 3 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 4 for Fine-tuning language models to find agreement among humans with diverse preferences
Viaarxiv icon

Adaptive patch foraging in deep reinforcement learning agents

Add code
Oct 14, 2022
Figure 1 for Adaptive patch foraging in deep reinforcement learning agents
Figure 2 for Adaptive patch foraging in deep reinforcement learning agents
Figure 3 for Adaptive patch foraging in deep reinforcement learning agents
Figure 4 for Adaptive patch foraging in deep reinforcement learning agents
Viaarxiv icon

Minimum Description Length Control

Add code
Jul 24, 2022
Figure 1 for Minimum Description Length Control
Figure 2 for Minimum Description Length Control
Figure 3 for Minimum Description Length Control
Figure 4 for Minimum Description Length Control
Viaarxiv icon

The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents

Add code
Mar 17, 2022
Figure 1 for The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents
Figure 2 for The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents
Figure 3 for The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents
Figure 4 for The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents
Viaarxiv icon

Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study

Add code
Dec 14, 2021
Figure 1 for Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Figure 2 for Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Figure 3 for Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Figure 4 for Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Viaarxiv icon

Perceiver IO: A General Architecture for Structured Inputs & Outputs

Add code
Aug 02, 2021
Figure 1 for Perceiver IO: A General Architecture for Structured Inputs & Outputs
Figure 2 for Perceiver IO: A General Architecture for Structured Inputs & Outputs
Figure 3 for Perceiver IO: A General Architecture for Structured Inputs & Outputs
Figure 4 for Perceiver IO: A General Architecture for Structured Inputs & Outputs
Viaarxiv icon

Stabilizing Transformers for Reinforcement Learning

Add code
Oct 13, 2019
Figure 1 for Stabilizing Transformers for Reinforcement Learning
Figure 2 for Stabilizing Transformers for Reinforcement Learning
Figure 3 for Stabilizing Transformers for Reinforcement Learning
Figure 4 for Stabilizing Transformers for Reinforcement Learning
Viaarxiv icon

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Add code
Sep 26, 2019
Figure 1 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 2 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 3 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 4 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Viaarxiv icon

Learned human-agent decision-making, communication and joint action in a virtual reality environment

Add code
May 07, 2019
Figure 1 for Learned human-agent decision-making, communication and joint action in a virtual reality environment
Figure 2 for Learned human-agent decision-making, communication and joint action in a virtual reality environment
Figure 3 for Learned human-agent decision-making, communication and joint action in a virtual reality environment
Viaarxiv icon