Picture for Tongzhou Mu

Tongzhou Mu

Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model

Add code
Dec 18, 2024
Figure 1 for Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Figure 2 for Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Figure 3 for Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Figure 4 for Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Viaarxiv icon

When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?

Add code
Dec 18, 2024
Figure 1 for When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?
Figure 2 for When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?
Figure 3 for When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?
Figure 4 for When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?
Viaarxiv icon

ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI

Add code
Oct 01, 2024
Figure 1 for ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI
Figure 2 for ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI
Figure 3 for ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI
Figure 4 for ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI
Viaarxiv icon

DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks

Add code
Apr 25, 2024
Viaarxiv icon

AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic Agent

Add code
Apr 11, 2024
Figure 1 for AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic Agent
Figure 2 for AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic Agent
Figure 3 for AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic Agent
Figure 4 for AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic Agent
Viaarxiv icon

Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving

Add code
Nov 01, 2023
Figure 1 for Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving
Figure 2 for Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving
Figure 3 for Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving
Figure 4 for Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving
Viaarxiv icon

Accelerated Doubly Stochastic Gradient Algorithm for Large-scale Empirical Risk Minimization

Add code
Apr 23, 2023
Viaarxiv icon

Boosting Reinforcement Learning and Planning with Demonstrations: A Survey

Add code
Mar 27, 2023
Viaarxiv icon

ManiSkill2: A Unified Benchmark for Generalizable Manipulation Skills

Add code
Feb 09, 2023
Viaarxiv icon

On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline

Add code
Dec 12, 2022
Viaarxiv icon