Picture for Adrien Ecoffet

Adrien Ecoffet

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Add code
Dec 14, 2023
Figure 1 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Figure 2 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Figure 3 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Figure 4 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Viaarxiv icon

Video PreTraining : Learning to Act by Watching Unlabeled Online Videos

Add code
Jun 23, 2022
Figure 1 for Video PreTraining : Learning to Act by Watching Unlabeled Online Videos
Figure 2 for Video PreTraining : Learning to Act by Watching Unlabeled Online Videos
Figure 3 for Video PreTraining : Learning to Act by Watching Unlabeled Online Videos
Figure 4 for Video PreTraining : Learning to Act by Watching Unlabeled Online Videos
Viaarxiv icon

Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft

Add code
Jun 28, 2021
Figure 1 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 2 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 3 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 4 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Viaarxiv icon

Open Questions in Creating Safe Open-ended AI: Tensions Between Control and Creativity

Add code
Jun 12, 2020
Viaarxiv icon

Reinforcement Learning Under Moral Uncertainty

Add code
Jun 08, 2020
Figure 1 for Reinforcement Learning Under Moral Uncertainty
Figure 2 for Reinforcement Learning Under Moral Uncertainty
Figure 3 for Reinforcement Learning Under Moral Uncertainty
Figure 4 for Reinforcement Learning Under Moral Uncertainty
Viaarxiv icon

First return then explore

Add code
May 14, 2020
Figure 1 for First return then explore
Figure 2 for First return then explore
Figure 3 for First return then explore
Figure 4 for First return then explore
Viaarxiv icon

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

Add code
Feb 21, 2020
Figure 1 for Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Figure 2 for Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Figure 3 for Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Figure 4 for Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Viaarxiv icon

Exploration Based Language Learning for Text-Based Games

Add code
Jan 24, 2020
Figure 1 for Exploration Based Language Learning for Text-Based Games
Figure 2 for Exploration Based Language Learning for Text-Based Games
Figure 3 for Exploration Based Language Learning for Text-Based Games
Figure 4 for Exploration Based Language Learning for Text-Based Games
Viaarxiv icon

Go-Explore: a New Approach for Hard-Exploration Problems

Add code
Jan 30, 2019
Figure 1 for Go-Explore: a New Approach for Hard-Exploration Problems
Figure 2 for Go-Explore: a New Approach for Hard-Exploration Problems
Figure 3 for Go-Explore: a New Approach for Hard-Exploration Problems
Figure 4 for Go-Explore: a New Approach for Hard-Exploration Problems
Viaarxiv icon