Picture for Grace Sodunke

Grace Sodunke

GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents

Add code
Jun 07, 2024
Viaarxiv icon

VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution

Add code
Jun 21, 2023
Figure 1 for VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution
Figure 2 for VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution
Figure 3 for VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution
Figure 4 for VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution
Viaarxiv icon