Picture for Serwan Jassim

Serwan Jassim

iVISPAR -- An Interactive Visual-Spatial Reasoning Benchmark for VLMs

Add code
Feb 05, 2025
Viaarxiv icon

GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models

Add code
Nov 15, 2023
Figure 1 for GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models
Figure 2 for GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models
Figure 3 for GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models
Figure 4 for GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models
Viaarxiv icon