Picture for Ameya Prabhu

Ameya Prabhu

Michael Pokorny

PostTrainBench: Can LLM Agents Automate LLM Post-Training?

Add code
Mar 10, 2026
Viaarxiv icon

Modular Memory is the Key to Continual Learning Agents

Add code
Mar 02, 2026
Viaarxiv icon

Intrinsic Credit Assignment for Long Horizon Interaction

Add code
Feb 12, 2026
Viaarxiv icon

Scaling Open-Ended Reasoning to Predict the Future

Add code
Dec 31, 2025
Viaarxiv icon

Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity

Add code
Oct 31, 2025
Viaarxiv icon

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Add code
Oct 10, 2025
Figure 1 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Figure 2 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Figure 3 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Figure 4 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Viaarxiv icon

VGGSounder: Audio-Visual Evaluations for Foundation Models

Add code
Aug 12, 2025
Viaarxiv icon

Answer Matching Outperforms Multiple Choice for Language Model Evaluation

Add code
Jul 03, 2025
Viaarxiv icon

Are We Done with Object-Centric Learning?

Add code
Apr 09, 2025
Figure 1 for Are We Done with Object-Centric Learning?
Figure 2 for Are We Done with Object-Centric Learning?
Figure 3 for Are We Done with Object-Centric Learning?
Figure 4 for Are We Done with Object-Centric Learning?
Viaarxiv icon

A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility

Add code
Apr 09, 2025
Viaarxiv icon