Picture for Weikai Chen

Weikai Chen

DualPrim: Compact 3D Reconstruction with Positive and Negative Primitives

Add code
Mar 17, 2026
Viaarxiv icon

SAP: Segment Any 4K Panorama

Add code
Mar 13, 2026
Viaarxiv icon

CoSMo3D: Open-World Promptable 3D Semantic Part Segmentation through LLM-Guided Canonical Spatial Modeling

Add code
Mar 01, 2026
Viaarxiv icon

MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns

Add code
Nov 16, 2025
Viaarxiv icon

AvatarTex: High-Fidelity Facial Texture Reconstruction from Single-Image Stylized Avatars

Add code
Nov 10, 2025
Viaarxiv icon

AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving

Add code
Nov 09, 2025
Figure 1 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Figure 2 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Figure 3 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Figure 4 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Viaarxiv icon

SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation

Add code
Sep 16, 2025
Figure 1 for SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation
Figure 2 for SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation
Figure 3 for SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation
Figure 4 for SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation
Viaarxiv icon

GarmentX: Autoregressive Parametric Representations for High-Fidelity 3D Garment Generation

Add code
Apr 29, 2025
Viaarxiv icon

TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation

Add code
Mar 14, 2025
Figure 1 for TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation
Figure 2 for TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation
Figure 3 for TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation
Figure 4 for TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation
Viaarxiv icon

Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method

Add code
Dec 12, 2024
Figure 1 for Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Figure 2 for Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Figure 3 for Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Figure 4 for Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Viaarxiv icon