Picture for Jirong Zha

Jirong Zha

How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM

Add code
Apr 08, 2025
Viaarxiv icon

UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces

Add code
Mar 08, 2025
Viaarxiv icon

A Unified Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability

Add code
Apr 03, 2024
Viaarxiv icon