Picture for Fangjun Li

Fangjun Li

Reframing Spatial Reasoning Evaluation in Language Models: A Real-World Simulation Benchmark for Qualitative Reasoning

Add code
May 23, 2024
Viaarxiv icon

Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark

Add code
Jan 08, 2024
Figure 1 for Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark
Figure 2 for Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark
Figure 3 for Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark
Figure 4 for Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark
Viaarxiv icon

Exploring the GLIDE model for Human Action-effect Prediction

Add code
Aug 01, 2022
Figure 1 for Exploring the GLIDE model for Human Action-effect Prediction
Figure 2 for Exploring the GLIDE model for Human Action-effect Prediction
Figure 3 for Exploring the GLIDE model for Human Action-effect Prediction
Figure 4 for Exploring the GLIDE model for Human Action-effect Prediction
Viaarxiv icon