Picture for Ahmed Ahmed

Ahmed Ahmed

Event-VStream: Event-Driven Real-Time Understanding for Long Video Streams

Add code
Jan 22, 2026
Viaarxiv icon

Quantifying the Effect of Test Set Contamination on Generative Evaluations

Add code
Jan 07, 2026
Viaarxiv icon

Extracting books from production language models

Add code
Jan 06, 2026
Viaarxiv icon

Measurement to Meaning: A Validity-Centered Framework for AI Evaluation

Add code
May 13, 2025
Viaarxiv icon

Independence Tests for Language Models

Add code
Feb 17, 2025
Viaarxiv icon