Picture for Tongtong Cao

Tongtong Cao

Efficient Camera Pose Augmentation for View Generalization in Robotic Policy Learning

Add code
Mar 31, 2026
Viaarxiv icon

Do World Action Models Generalize Better than VLAs? A Robustness Study

Add code
Mar 23, 2026
Viaarxiv icon

H-WM: Robotic Task and Motion Planning Guided by Hierarchical World Model

Add code
Feb 11, 2026
Viaarxiv icon

Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation

Add code
Feb 20, 2025
Figure 1 for Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation
Figure 2 for Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation
Figure 3 for Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation
Figure 4 for Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation
Viaarxiv icon

SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning

Add code
Jan 17, 2025
Figure 1 for SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning
Figure 2 for SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning
Figure 3 for SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning
Figure 4 for SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning
Viaarxiv icon

UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations

Add code
Nov 22, 2024
Figure 1 for UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations
Figure 2 for UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations
Figure 3 for UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations
Figure 4 for UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations
Viaarxiv icon

3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications

Add code
Oct 14, 2024
Figure 1 for 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications
Figure 2 for 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications
Figure 3 for 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications
Figure 4 for 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications
Viaarxiv icon

AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction

Add code
Jul 02, 2024
Figure 1 for AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction
Figure 2 for AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction
Figure 3 for AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction
Figure 4 for AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction
Viaarxiv icon

Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge

Add code
Sep 28, 2023
Figure 1 for Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge
Figure 2 for Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge
Figure 3 for Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge
Figure 4 for Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge
Viaarxiv icon

GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds

Add code
Aug 16, 2023
Figure 1 for GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds
Figure 2 for GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds
Figure 3 for GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds
Figure 4 for GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds
Viaarxiv icon