Picture for Zihan Ding

Zihan Ding

Recurrent Autoregressive Diffusion: Global Memory Meets Local Attention

Add code
Nov 17, 2025
Viaarxiv icon

Single-stream Policy Optimization

Add code
Sep 16, 2025
Viaarxiv icon

ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes

Add code
Jun 17, 2025
Figure 1 for ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes
Figure 2 for ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes
Figure 3 for ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes
Figure 4 for ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes
Viaarxiv icon

Learning World Models for Interactive Video Generation

Add code
May 28, 2025
Viaarxiv icon

Variable-Friction In-Hand Manipulation for Arbitrary Objects via Diffusion-Based Imitation Learning

Add code
Mar 04, 2025
Figure 1 for Variable-Friction In-Hand Manipulation for Arbitrary Objects via Diffusion-Based Imitation Learning
Figure 2 for Variable-Friction In-Hand Manipulation for Arbitrary Objects via Diffusion-Based Imitation Learning
Figure 3 for Variable-Friction In-Hand Manipulation for Arbitrary Objects via Diffusion-Based Imitation Learning
Figure 4 for Variable-Friction In-Hand Manipulation for Arbitrary Objects via Diffusion-Based Imitation Learning
Viaarxiv icon

Generative Diffusion Modeling: A Practical Handbook

Add code
Dec 22, 2024
Viaarxiv icon

DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization

Add code
Dec 20, 2024
Viaarxiv icon

TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation

Add code
Nov 25, 2024
Figure 1 for TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation
Figure 2 for TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation
Figure 3 for TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation
Figure 4 for TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation
Viaarxiv icon

Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding

Add code
Sep 12, 2024
Figure 1 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 2 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 3 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 4 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Viaarxiv icon

How to beat a Bayesian adversary

Add code
Jul 11, 2024
Figure 1 for How to beat a Bayesian adversary
Figure 2 for How to beat a Bayesian adversary
Figure 3 for How to beat a Bayesian adversary
Figure 4 for How to beat a Bayesian adversary
Viaarxiv icon