Picture for Yilun Du

Yilun Du

Derek

Inference-Time Policy Steering through Human Interactions

Add code
Nov 25, 2024
Viaarxiv icon

SnapMem: Snapshot-based 3D Scene Memory for Embodied Exploration and Reasoning

Add code
Nov 23, 2024
Viaarxiv icon

Grounding Video Models to Actions through Goal Conditioned Exploration

Add code
Nov 11, 2024
Viaarxiv icon

Few-Shot Task Learning through Inverse Generative Modeling

Add code
Nov 07, 2024
Viaarxiv icon

Compositional Diffusion Models for Powered Descent Trajectory Generation with Flexible Constraints

Add code
Oct 05, 2024
Viaarxiv icon

AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Add code
Oct 04, 2024
Viaarxiv icon

Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations

Add code
Aug 08, 2024
Figure 1 for Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations
Figure 2 for Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations
Figure 3 for Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations
Figure 4 for Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations
Viaarxiv icon

Disentangled Acoustic Fields For Multimodal Physical Scene Understanding

Add code
Jul 16, 2024
Figure 1 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 2 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 3 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 4 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Viaarxiv icon

Potential Based Diffusion Motion Planning

Add code
Jul 08, 2024
Viaarxiv icon

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Add code
Jul 02, 2024
Viaarxiv icon