Picture for Sergey Levine

Sergey Levine

Stanford University

Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review

Add code
Jan 16, 2025
Viaarxiv icon

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Add code
Jan 16, 2025
Viaarxiv icon

Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding

Add code
Jan 08, 2025
Viaarxiv icon

Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents

Add code
Dec 17, 2024
Viaarxiv icon

RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning

Add code
Dec 13, 2024
Viaarxiv icon

Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data

Add code
Dec 10, 2024
Viaarxiv icon

Predicting Emergent Capabilities by Finetuning

Add code
Nov 25, 2024
Figure 1 for Predicting Emergent Capabilities by Finetuning
Figure 2 for Predicting Emergent Capabilities by Finetuning
Figure 3 for Predicting Emergent Capabilities by Finetuning
Figure 4 for Predicting Emergent Capabilities by Finetuning
Viaarxiv icon

What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?

Add code
Nov 12, 2024
Figure 1 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 2 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 3 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 4 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Viaarxiv icon

Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations

Add code
Nov 07, 2024
Figure 1 for Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations
Figure 2 for Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations
Figure 3 for Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations
Figure 4 for Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations
Viaarxiv icon

Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning

Add code
Nov 07, 2024
Figure 1 for Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
Figure 2 for Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
Figure 3 for Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
Figure 4 for Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
Viaarxiv icon