Picture for Jia-Bin Huang

Jia-Bin Huang

Guava: An Effective and Universal Harness for Embodied Manipulation

Add code
Jun 16, 2026
Viaarxiv icon

$μ_0$: A Scalable 3D Interaction-Trace World Model

Add code
Jun 11, 2026
Viaarxiv icon

UniVerse: A Unified Modulation Framework for Segmentation-Free,Disentangled Multi-Concept Personalization

Add code
May 29, 2026
Viaarxiv icon

DynaFLIP: Rethinking Robotics Perception via Tri-Modal-Dynamics Guided Representation

Add code
May 28, 2026
Viaarxiv icon

TRACE: Object Motion Editing in Videos with First-Frame Trajectory Guidance

Add code
Mar 26, 2026
Viaarxiv icon

Generative Refocusing: Flexible Defocus Control from a Single Image

Add code
Dec 18, 2025
Figure 1 for Generative Refocusing: Flexible Defocus Control from a Single Image
Figure 2 for Generative Refocusing: Flexible Defocus Control from a Single Image
Figure 3 for Generative Refocusing: Flexible Defocus Control from a Single Image
Figure 4 for Generative Refocusing: Flexible Defocus Control from a Single Image
Viaarxiv icon

Coupled Diffusion Sampling for Training-Free Multi-View Image Editing

Add code
Oct 16, 2025
Viaarxiv icon

Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework

Add code
Aug 06, 2025
Viaarxiv icon

Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models

Add code
May 12, 2025
Figure 1 for Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Figure 2 for Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Figure 3 for Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Figure 4 for Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Viaarxiv icon

LIRM: Large Inverse Rendering Model for Progressive Reconstruction of Shape, Materials and View-dependent Radiance Fields

Add code
Apr 28, 2025
Viaarxiv icon