Picture for Sayash Kapoor

Sayash Kapoor

CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark

Add code
Sep 17, 2024
Viaarxiv icon

The Foundation Model Transparency Index v1.1: May 2024

Add code
Jul 17, 2024
Viaarxiv icon

AI Agents That Matter

Add code
Jul 01, 2024
Viaarxiv icon

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Add code
Jun 26, 2024
Viaarxiv icon

A Safe Harbor for AI Evaluation and Red Teaming

Add code
Mar 07, 2024
Figure 1 for A Safe Harbor for AI Evaluation and Red Teaming
Figure 2 for A Safe Harbor for AI Evaluation and Red Teaming
Figure 3 for A Safe Harbor for AI Evaluation and Red Teaming
Figure 4 for A Safe Harbor for AI Evaluation and Red Teaming
Viaarxiv icon

On the Societal Impact of Open Foundation Models

Add code
Feb 27, 2024
Viaarxiv icon

Foundation Model Transparency Reports

Add code
Feb 26, 2024
Viaarxiv icon

The Foundation Model Transparency Index

Add code
Oct 19, 2023
Viaarxiv icon

REFORMS: Reporting Standards for Machine Learning Based Science

Add code
Aug 15, 2023
Viaarxiv icon

Leakage and the Reproducibility Crisis in ML-based Science

Add code
Jul 14, 2022
Figure 1 for Leakage and the Reproducibility Crisis in ML-based Science
Figure 2 for Leakage and the Reproducibility Crisis in ML-based Science
Figure 3 for Leakage and the Reproducibility Crisis in ML-based Science
Viaarxiv icon