Picture for Chen Gao

Chen Gao

How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM

Add code
Apr 08, 2025
Viaarxiv icon

The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?

Add code
Apr 06, 2025
Viaarxiv icon

Can LLMs Assist Computer Education? an Empirical Case Study of DeepSeek

Add code
Apr 01, 2025
Viaarxiv icon

AssistPDA: An Online Video Surveillance Assistant for Video Anomaly Prediction, Detection, and Analysis

Add code
Mar 27, 2025
Viaarxiv icon

GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection

Add code
Mar 26, 2025
Viaarxiv icon

Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space

Add code
Mar 14, 2025
Viaarxiv icon

UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces

Add code
Mar 08, 2025
Viaarxiv icon

A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval

Add code
Mar 07, 2025
Figure 1 for A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval
Figure 2 for A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval
Figure 3 for A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval
Figure 4 for A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval
Viaarxiv icon

CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space

Add code
Feb 20, 2025
Viaarxiv icon

PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models

Add code
Feb 20, 2025
Viaarxiv icon