Picture for Oam Patel

Oam Patel

Designing a Dashboard for Transparency and Control of Conversational AI

Add code
Jun 12, 2024
Viaarxiv icon

Defending Against Unforeseen Failure Modes with Latent Adversarial Training

Add code
Mar 08, 2024
Viaarxiv icon

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Add code
Jun 07, 2023
Viaarxiv icon