Picture for Sergey Levine

Sergey Levine

Octo: An Open-Source Generalist Robot Policy

Add code
May 20, 2024
Viaarxiv icon

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Add code
May 17, 2024
Viaarxiv icon

Evaluating Real-World Robot Manipulation Policies in Simulation

Add code
May 09, 2024
Viaarxiv icon

RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes

Add code
May 07, 2024
Viaarxiv icon

Learning Visuotactile Skills with Two Multifingered Hands

Add code
Apr 25, 2024
Viaarxiv icon

Autonomous Evaluation and Refinement of Digital Agents

Add code
Apr 10, 2024
Viaarxiv icon

Yell At Your Robot: Improving On-the-Fly from Language Corrections

Add code
Mar 19, 2024
Figure 1 for Yell At Your Robot: Improving On-the-Fly from Language Corrections
Figure 2 for Yell At Your Robot: Improving On-the-Fly from Language Corrections
Figure 3 for Yell At Your Robot: Improving On-the-Fly from Language Corrections
Figure 4 for Yell At Your Robot: Improving On-the-Fly from Language Corrections
Viaarxiv icon

DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

Add code
Mar 19, 2024
Figure 1 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Figure 2 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Figure 3 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Figure 4 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Viaarxiv icon

Unfamiliar Finetuning Examples Control How Language Models Hallucinate

Add code
Mar 08, 2024
Figure 1 for Unfamiliar Finetuning Examples Control How Language Models Hallucinate
Figure 2 for Unfamiliar Finetuning Examples Control How Language Models Hallucinate
Figure 3 for Unfamiliar Finetuning Examples Control How Language Models Hallucinate
Figure 4 for Unfamiliar Finetuning Examples Control How Language Models Hallucinate
Viaarxiv icon

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL

Add code
Mar 06, 2024
Figure 1 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Figure 2 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Figure 3 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Figure 4 for Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Viaarxiv icon