Picture for Nicolas Heess

Nicolas Heess

Informatics

Preference Optimization as Probabilistic Inference

Add code
Oct 05, 2024
Viaarxiv icon

DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots

Add code
Sep 10, 2024
Figure 1 for DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Figure 2 for DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Figure 3 for DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Figure 4 for DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Viaarxiv icon

A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning

Add code
Jun 04, 2024
Viaarxiv icon

Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice

Add code
May 19, 2024
Viaarxiv icon

Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning

Add code
May 03, 2024
Figure 1 for Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
Figure 2 for Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
Figure 3 for Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
Figure 4 for Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
Viaarxiv icon

The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models

Add code
Apr 04, 2024
Viaarxiv icon

Genie: Generative Interactive Environments

Add code
Feb 23, 2024
Figure 1 for Genie: Generative Interactive Environments
Figure 2 for Genie: Generative Interactive Environments
Figure 3 for Genie: Generative Interactive Environments
Figure 4 for Genie: Generative Interactive Environments
Viaarxiv icon

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Add code
Feb 18, 2024
Figure 1 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 2 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 3 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 4 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Viaarxiv icon

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

Add code
Feb 12, 2024
Figure 1 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 2 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 3 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 4 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Viaarxiv icon

Offline Actor-Critic Reinforcement Learning Scales to Large Models

Add code
Feb 08, 2024
Figure 1 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 2 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 3 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 4 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Viaarxiv icon