Picture for Nabil Omi

Nabil Omi

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Add code
Jan 17, 2026
Viaarxiv icon

Generative Modeling of Individual Behavior at Scale

Add code
Feb 20, 2025
Viaarxiv icon

Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning

Add code
Oct 31, 2024
Viaarxiv icon