Picture for Grace Sodunke

Grace Sodunke

GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents

Add code
Jun 07, 2024
Viaarxiv icon

VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution

Add code
Jun 21, 2023
Viaarxiv icon