Picture for Doug Downey

Doug Downey

Allen Institute for Artificial Intelligence, Northwestern University

AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite

Add code
Oct 24, 2025
Viaarxiv icon

Demystifying Scientific Problem-Solving in LLMs by Probing Knowledge and Reasoning

Add code
Aug 26, 2025
Viaarxiv icon

SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks

Add code
Jul 01, 2025
Figure 1 for SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks
Figure 2 for SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks
Figure 3 for SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks
Figure 4 for SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks
Viaarxiv icon

Ai2 Scholar QA: Organized Literature Synthesis with Attribution

Add code
Apr 15, 2025
Viaarxiv icon

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Add code
Nov 21, 2024
Figure 1 for OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs
Figure 2 for OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs
Figure 3 for OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs
Figure 4 for OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs
Viaarxiv icon

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

Add code
Jun 10, 2024
Figure 1 for SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
Figure 2 for SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
Figure 3 for SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
Figure 4 for SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
Viaarxiv icon

TOPICAL: TOPIC Pages AutomagicaLly

Add code
May 03, 2024
Figure 1 for TOPICAL: TOPIC Pages AutomagicaLly
Figure 2 for TOPICAL: TOPIC Pages AutomagicaLly
Figure 3 for TOPICAL: TOPIC Pages AutomagicaLly
Figure 4 for TOPICAL: TOPIC Pages AutomagicaLly
Viaarxiv icon

MARG: Multi-Agent Review Generation for Scientific Papers

Add code
Jan 08, 2024
Viaarxiv icon

CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies

Add code
Nov 19, 2023
Viaarxiv icon

CARE: Extracting Experimental Findings From Clinical Literature

Add code
Nov 16, 2023
Viaarxiv icon