Picture for Yunze Liu

Yunze Liu

ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos

Add code
Mar 16, 2025
Viaarxiv icon

VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining

Add code
Mar 16, 2025
Viaarxiv icon

MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data

Add code
Jan 08, 2025
Figure 1 for MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data
Figure 2 for MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data
Figure 3 for MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data
Figure 4 for MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data
Viaarxiv icon

MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone's Potential with Masked Autoregressive Pretraining

Add code
Oct 01, 2024
Viaarxiv icon

Physics-aware Hand-object Interaction Denoising

Add code
May 19, 2024
Viaarxiv icon

PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation

Add code
Apr 01, 2024
Figure 1 for PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation
Figure 2 for PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation
Figure 3 for PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation
Figure 4 for PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation
Viaarxiv icon

CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point Cloud Video Understanding

Add code
Jan 17, 2024
Viaarxiv icon

Interactive Humanoid: Online Full-Body Motion Reaction Synthesis with Social Affordance Canonicalization and Forecasting

Add code
Dec 30, 2023
Viaarxiv icon

NSM4D: Neural Scene Model Based Online 4D Point Cloud Sequence Understanding

Add code
Oct 12, 2023
Viaarxiv icon

Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning

Add code
Dec 20, 2022
Figure 1 for Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning
Figure 2 for Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning
Figure 3 for Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning
Figure 4 for Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning
Viaarxiv icon