Picture for Lionel Levine

Lionel Levine

FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

Add code
Nov 07, 2024
Viaarxiv icon

Exploring Cross-model Neuronal Correlations in the Context of Predicting Model Performance and Generalizability

Add code
Aug 15, 2024
Viaarxiv icon

Do language models plan ahead for future tokens?

Add code
Apr 01, 2024
Viaarxiv icon

A Self-supervised Framework for Improved Data-Driven Monitoring of Stress via Multi-modal Passive Sensing

Add code
Mar 24, 2023
Viaarxiv icon