Picture for Susan Holm

Susan Holm

NoveltyBench: Evaluating Language Models for Humanlike Diversity

Add code
Apr 08, 2025
Viaarxiv icon

ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding

Add code
Oct 29, 2024
Viaarxiv icon

Formulation Comparison for Timeline Construction using LLMs

Add code
Mar 01, 2024
Viaarxiv icon