Picture for Nicolas Heess

Nicolas Heess

Informatics

Preference Optimization as Probabilistic Inference

Add code
Oct 05, 2024
Viaarxiv icon

DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots

Add code
Sep 10, 2024
Figure 1 for DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Figure 2 for DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Figure 3 for DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Figure 4 for DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Viaarxiv icon

A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning

Add code
Jun 04, 2024
Viaarxiv icon

Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice

Add code
May 19, 2024
Viaarxiv icon

Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning

Add code
May 03, 2024
Viaarxiv icon

The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models

Add code
Apr 04, 2024
Viaarxiv icon

Genie: Generative Interactive Environments

Add code
Feb 23, 2024
Viaarxiv icon

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Add code
Feb 18, 2024
Figure 1 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 2 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 3 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 4 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Viaarxiv icon

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

Add code
Feb 12, 2024
Figure 1 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 2 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 3 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 4 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Viaarxiv icon

Offline Actor-Critic Reinforcement Learning Scales to Large Models

Add code
Feb 08, 2024
Figure 1 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 2 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 3 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 4 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Viaarxiv icon