Picture for Ping Tan

Ping Tan

Simon Fraser University

GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor

Add code
Jan 17, 2025
Viaarxiv icon

Universal Features Guided Zero-Shot Category-Level Object Pose Estimation

Add code
Jan 06, 2025
Viaarxiv icon

DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT

Add code
Dec 30, 2024
Figure 1 for DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Figure 2 for DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Figure 3 for DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Figure 4 for DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Viaarxiv icon

DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT

Add code
Dec 27, 2024
Figure 1 for DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT
Figure 2 for DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT
Figure 3 for DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT
Figure 4 for DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT
Viaarxiv icon

Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Add code
Dec 24, 2024
Figure 1 for Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
Figure 2 for Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
Figure 3 for Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
Figure 4 for Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
Viaarxiv icon

Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation

Add code
Dec 11, 2024
Figure 1 for Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation
Figure 2 for Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation
Figure 3 for Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation
Figure 4 for Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation
Viaarxiv icon

World-Consistent Data Generation for Vision-and-Language Navigation

Add code
Dec 09, 2024
Viaarxiv icon

Dual Prototyping with Domain and Class Prototypes for Affective Brain-Computer Interface in Unseen Target Conditions

Add code
Nov 27, 2024
Viaarxiv icon

MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework

Add code
Nov 26, 2024
Figure 1 for MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework
Figure 2 for MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework
Figure 3 for MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework
Figure 4 for MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework
Viaarxiv icon

Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

Add code
Nov 26, 2024
Figure 1 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Figure 2 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Figure 3 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Figure 4 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Viaarxiv icon