Picture for Jiahe Jin

Jiahe Jin

Revisiting 3D LLM Benchmarks: Are We Really Testing 3D Capabilities?

Add code
Feb 12, 2025
Viaarxiv icon

PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

Add code
Dec 23, 2024
Viaarxiv icon

BeHonest: Benchmarking Honesty of Large Language Models

Add code
Jun 19, 2024
Viaarxiv icon