Picture for Zile Zhou

Zile Zhou

Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space

Add code
Mar 14, 2025
Viaarxiv icon

EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment

Add code
Oct 12, 2024
Viaarxiv icon

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark

Add code
Dec 27, 2021
Figure 1 for CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark
Figure 2 for CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark
Figure 3 for CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark
Figure 4 for CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark
Viaarxiv icon