Picture for Aaron Kirtland

Aaron Kirtland

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

Add code
Jul 10, 2024
Viaarxiv icon

REBUS: A Robust Evaluation Benchmark of Understanding Symbols

Add code
Jan 11, 2024
Viaarxiv icon

Inverse Scaling: When Bigger Isn't Better

Add code
Jun 15, 2023
Viaarxiv icon