Picture for Zhiwen Fan

Zhiwen Fan

How Independent are Large Language Models? A Statistical Framework for Auditing Behavioral Entanglement and Reweighting Verifier Ensembles

Add code
Apr 08, 2026
Viaarxiv icon

SpatialStack: Layered Geometry-Language Fusion for 3D VLM Spatial Reasoning

Add code
Mar 28, 2026
Viaarxiv icon

NavTrust: Benchmarking Trustworthiness for Embodied Navigation

Add code
Mar 19, 2026
Viaarxiv icon

NanoGS: Training-Free Gaussian Splat Simplification

Add code
Mar 17, 2026
Viaarxiv icon

Learning Actionable Manipulation Recovery via Counterfactual Failure Synthesis

Add code
Mar 13, 2026
Viaarxiv icon

Egocentric World Model for Photorealistic Hand-Object Interaction Synthesis

Add code
Mar 13, 2026
Viaarxiv icon

RuleSmith: Multi-Agent LLMs for Automated Game Balancing

Add code
Feb 05, 2026
Viaarxiv icon

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Add code
Jul 16, 2025
Figure 1 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 2 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 3 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 4 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Viaarxiv icon

Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions

Add code
Jul 10, 2025
Figure 1 for Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Figure 2 for Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Figure 3 for Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Figure 4 for Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Viaarxiv icon

CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy

Add code
Jun 06, 2025
Viaarxiv icon