Picture for Matthias Bethge

Matthias Bethge

Are We Done with Object-Centric Learning?

Add code
Apr 09, 2025
Viaarxiv icon

A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility

Add code
Apr 09, 2025
Viaarxiv icon

Understanding the Limits of Lifelong Knowledge Editing in LLMs

Add code
Mar 07, 2025
Viaarxiv icon

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Add code
Feb 26, 2025
Viaarxiv icon

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Add code
Feb 26, 2025
Viaarxiv icon

Testing the limits of fine-tuning to improve reasoning in vision language models

Add code
Feb 21, 2025
Viaarxiv icon

LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws

Add code
Feb 17, 2025
Viaarxiv icon

Great Models Think Alike and this Undermines AI Oversight

Add code
Feb 06, 2025
Viaarxiv icon

How to Merge Your Multimodal Models Over Time?

Add code
Dec 09, 2024
Figure 1 for How to Merge Your Multimodal Models Over Time?
Figure 2 for How to Merge Your Multimodal Models Over Time?
Figure 3 for How to Merge Your Multimodal Models Over Time?
Figure 4 for How to Merge Your Multimodal Models Over Time?
Viaarxiv icon

ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities

Add code
Dec 09, 2024
Figure 1 for ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Figure 2 for ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Figure 3 for ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Figure 4 for ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Viaarxiv icon