Picture for Hamed Hassani

Hamed Hassani

Safety Guardrails for LLM-Enabled Robots

Add code
Mar 10, 2025
Viaarxiv icon

Adaptively evaluating models with task elicitation

Add code
Mar 03, 2025
Viaarxiv icon

Likelihood-Ratio Regularized Quantile Regression: Adapting Conformal Prediction to High-Dimensional Covariate Shifts

Add code
Feb 18, 2025
Viaarxiv icon

Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents

Add code
Feb 04, 2025
Viaarxiv icon

On The Concurrence of Layer-wise Preconditioning Methods and Provable Feature Learning

Add code
Feb 03, 2025
Viaarxiv icon

Adversarial Reasoning at Jailbreaking Time

Add code
Feb 03, 2025
Viaarxiv icon

Asymptotics of Linear Regression with Linearly Dependent Data

Add code
Dec 04, 2024
Figure 1 for Asymptotics of Linear Regression with Linearly Dependent Data
Figure 2 for Asymptotics of Linear Regression with Linearly Dependent Data
Figure 3 for Asymptotics of Linear Regression with Linearly Dependent Data
Figure 4 for Asymptotics of Linear Regression with Linearly Dependent Data
Viaarxiv icon

Conformal Risk Minimization with Variance Reduction

Add code
Nov 03, 2024
Figure 1 for Conformal Risk Minimization with Variance Reduction
Figure 2 for Conformal Risk Minimization with Variance Reduction
Figure 3 for Conformal Risk Minimization with Variance Reduction
Figure 4 for Conformal Risk Minimization with Variance Reduction
Viaarxiv icon

Jailbreaking LLM-Controlled Robots

Add code
Oct 17, 2024
Figure 1 for Jailbreaking LLM-Controlled Robots
Figure 2 for Jailbreaking LLM-Controlled Robots
Figure 3 for Jailbreaking LLM-Controlled Robots
Figure 4 for Jailbreaking LLM-Controlled Robots
Viaarxiv icon

Watermark Smoothing Attacks against Language Models

Add code
Jul 19, 2024
Figure 1 for Watermark Smoothing Attacks against Language Models
Figure 2 for Watermark Smoothing Attacks against Language Models
Figure 3 for Watermark Smoothing Attacks against Language Models
Figure 4 for Watermark Smoothing Attacks against Language Models
Viaarxiv icon