Picture for Arun Suggala

Arun Suggala

Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?

Add code
Dec 04, 2024
Viaarxiv icon

Time-Reversal Provides Unsupervised Feedback to LLMs

Add code
Dec 03, 2024
Viaarxiv icon

Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal Health Program

Add code
Oct 28, 2024
Viaarxiv icon

Near-Optimal Streaming Heavy-Tailed Statistical Estimation with Clipped SGD

Add code
Oct 26, 2024
Viaarxiv icon

Efficient Public Health Intervention Planning Using Decomposition-Based Decision-Focused Learning

Add code
Mar 08, 2024
Viaarxiv icon

Second Order Methods for Bandit Optimization and Control

Add code
Feb 14, 2024
Viaarxiv icon

Responsible AI (RAI) Games and Ensembles

Add code
Oct 28, 2023
Viaarxiv icon