Picture for Ameya Prabhu

Ameya Prabhu

Michael Pokorny

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Add code
Feb 26, 2025
Viaarxiv icon

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Add code
Feb 26, 2025
Viaarxiv icon

Great Models Think Alike and this Undermines AI Oversight

Add code
Feb 06, 2025
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

Open Problems in Machine Unlearning for AI Safety

Add code
Jan 09, 2025
Viaarxiv icon

ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities

Add code
Dec 09, 2024
Figure 1 for ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Figure 2 for ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Figure 3 for ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Figure 4 for ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Viaarxiv icon

How to Merge Your Multimodal Models Over Time?

Add code
Dec 09, 2024
Figure 1 for How to Merge Your Multimodal Models Over Time?
Figure 2 for How to Merge Your Multimodal Models Over Time?
Figure 3 for How to Merge Your Multimodal Models Over Time?
Figure 4 for How to Merge Your Multimodal Models Over Time?
Viaarxiv icon

Delta-Influence: Unlearning Poisons via Influence Functions

Add code
Nov 20, 2024
Figure 1 for Delta-Influence: Unlearning Poisons via Influence Functions
Figure 2 for Delta-Influence: Unlearning Poisons via Influence Functions
Figure 3 for Delta-Influence: Unlearning Poisons via Influence Functions
Figure 4 for Delta-Influence: Unlearning Poisons via Influence Functions
Viaarxiv icon

A Practitioner's Guide to Continual Multimodal Pretraining

Add code
Aug 26, 2024
Viaarxiv icon

Data Contamination Report from the 2024 CONDA Shared Task

Add code
Jul 31, 2024
Figure 1 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 2 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 3 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 4 for Data Contamination Report from the 2024 CONDA Shared Task
Viaarxiv icon