Picture for Baoxiong Jia

Baoxiong Jia

Multi-modal Situated Reasoning in 3D Scenes

Add code
Sep 04, 2024
Viaarxiv icon

PhysPart: Physically Plausible Part Completion for Interactable Objects

Add code
Aug 25, 2024
Viaarxiv icon

SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields

Add code
Aug 13, 2024
Viaarxiv icon

Task-oriented Sequential Grounding in 3D Scenes

Add code
Aug 07, 2024
Figure 1 for Task-oriented Sequential Grounding in 3D Scenes
Figure 2 for Task-oriented Sequential Grounding in 3D Scenes
Figure 3 for Task-oriented Sequential Grounding in 3D Scenes
Figure 4 for Task-oriented Sequential Grounding in 3D Scenes
Viaarxiv icon

Unifying 3D Vision-Language Understanding via Promptable Queries

Add code
May 19, 2024
Figure 1 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 2 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 3 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 4 for Unifying 3D Vision-Language Understanding via Promptable Queries
Viaarxiv icon

Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V

Add code
Apr 16, 2024
Viaarxiv icon

PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI

Add code
Apr 15, 2024
Viaarxiv icon

Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance

Add code
Mar 26, 2024
Viaarxiv icon

SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

Add code
Jan 17, 2024
Viaarxiv icon

An Embodied Generalist Agent in 3D World

Add code
Nov 18, 2023
Viaarxiv icon