Picture for Max Simchowitz

Max Simchowitz

Self-Improvement in Language Models: The Sharpening Mechanism

Add code
Dec 02, 2024
Viaarxiv icon

Is Linear Feedback on Smoothed Dynamics Sufficient for Stabilizing Contact-Rich Plans?

Add code
Nov 14, 2024
Viaarxiv icon

Faster Algorithms for Growing Collision-Free Convex Polytopes in Robot Configuration Space

Add code
Oct 16, 2024
Viaarxiv icon

Diffusion Policy Policy Optimization

Add code
Sep 01, 2024
Figure 1 for Diffusion Policy Policy Optimization
Figure 2 for Diffusion Policy Policy Optimization
Figure 3 for Diffusion Policy Policy Optimization
Figure 4 for Diffusion Policy Policy Optimization
Viaarxiv icon

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Add code
Jul 02, 2024
Viaarxiv icon

Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression

Add code
Oct 17, 2023
Viaarxiv icon

Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Add code
Oct 02, 2023
Figure 1 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use
Figure 2 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use
Figure 3 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use
Figure 4 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use
Viaarxiv icon

Constrained Bimanual Planning with Analytic Inverse Kinematics

Add code
Sep 15, 2023
Viaarxiv icon

RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability

Add code
Aug 31, 2023
Viaarxiv icon

Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior

Add code
Jul 29, 2023
Figure 1 for Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior
Figure 2 for Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior
Figure 3 for Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior
Figure 4 for Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior
Viaarxiv icon