Picture for Martin J. Chadwick

Martin J. Chadwick

Fine-tuning language models to find agreement among humans with diverse preferences

Add code
Nov 28, 2022
Figure 1 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 2 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 3 for Fine-tuning language models to find agreement among humans with diverse preferences
Figure 4 for Fine-tuning language models to find agreement among humans with diverse preferences
Viaarxiv icon

Deep reinforcement learning models the emergent dynamics of human cooperation

Add code
Mar 08, 2021
Figure 1 for Deep reinforcement learning models the emergent dynamics of human cooperation
Figure 2 for Deep reinforcement learning models the emergent dynamics of human cooperation
Figure 3 for Deep reinforcement learning models the emergent dynamics of human cooperation
Viaarxiv icon

MEMO: A Deep Network for Flexible Combination of Episodic Memories

Add code
Jan 29, 2020
Figure 1 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Figure 2 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Figure 3 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Figure 4 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Viaarxiv icon