Picture for Kuangzhi Ge

Kuangzhi Ge

TC-IDM: Grounding Video Generation for Executable Zero-shot Robot Motion

Add code
Jan 26, 2026
Viaarxiv icon

PhysicsMind: Sim and Real Mechanics Benchmarking for Physical Reasoning and Prediction in Foundational VLMs and World Models

Add code
Jan 22, 2026
Viaarxiv icon

Wow, wo, val! A Comprehensive Embodied World Model Evaluation Turing Test

Add code
Jan 07, 2026
Viaarxiv icon

Can World Models Benefit VLMs for World Dynamics?

Add code
Oct 01, 2025
Viaarxiv icon

WoW: Towards a World omniscient World model Through Embodied Interaction

Add code
Sep 26, 2025
Viaarxiv icon

MinD: Unified Visual Imagination and Control via Hierarchical World Models

Add code
Jun 23, 2025
Figure 1 for MinD: Unified Visual Imagination and Control via Hierarchical World Models
Figure 2 for MinD: Unified Visual Imagination and Control via Hierarchical World Models
Figure 3 for MinD: Unified Visual Imagination and Control via Hierarchical World Models
Figure 4 for MinD: Unified Visual Imagination and Control via Hierarchical World Models
Viaarxiv icon

SCBench: A Sports Commentary Benchmark for Video LLMs

Add code
Dec 23, 2024
Viaarxiv icon