Picture for Haonan Chang

Haonan Chang

Motion Blender Gaussian Splatting for Dynamic Reconstruction

Add code
Mar 12, 2025
Viaarxiv icon

Autoregressive Action Sequence Learning for Robotic Manipulation

Add code
Oct 04, 2024
Viaarxiv icon

UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models

Add code
Sep 30, 2024
Figure 1 for UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models
Figure 2 for UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models
Figure 3 for UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models
Figure 4 for UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models
Viaarxiv icon

DAP: Diffusion-based Affordance Prediction for Multi-modality Storage

Add code
Aug 31, 2024
Viaarxiv icon

Scaling Manipulation Learning with Visual Kinematic Chain Prediction

Add code
Jun 12, 2024
Figure 1 for Scaling Manipulation Learning with Visual Kinematic Chain Prediction
Figure 2 for Scaling Manipulation Learning with Visual Kinematic Chain Prediction
Figure 3 for Scaling Manipulation Learning with Visual Kinematic Chain Prediction
Figure 4 for Scaling Manipulation Learning with Visual Kinematic Chain Prediction
Viaarxiv icon

A3VLM: Actionable Articulation-Aware Vision Language Model

Add code
Jun 11, 2024
Viaarxiv icon

OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data

Add code
Nov 06, 2023
Viaarxiv icon

LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement

Add code
Sep 27, 2023
Figure 1 for LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement
Figure 2 for LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement
Figure 3 for LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement
Figure 4 for LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement
Viaarxiv icon

Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs

Add code
Sep 27, 2023
Figure 1 for Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs
Figure 2 for Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs
Figure 3 for Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs
Figure 4 for Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs
Viaarxiv icon

Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction

Add code
Jan 30, 2023
Figure 1 for Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction
Figure 2 for Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction
Figure 3 for Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction
Figure 4 for Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction
Viaarxiv icon