Picture for Ahmad Beirami

Ahmad Beirami

EJ

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

Add code
Nov 27, 2024
Figure 1 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 2 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 3 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 4 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Viaarxiv icon

Generalization Error of the Tilted Empirical Risk

Add code
Sep 28, 2024
Viaarxiv icon

Inducing Group Fairness in LLM-Based Decisions

Add code
Jun 24, 2024
Viaarxiv icon

Safety Alignment Should Be Made More Than Just a Few Tokens Deep

Add code
Jun 10, 2024
Viaarxiv icon

Robust Preference Optimization through Reward Model Distillation

Add code
May 29, 2024
Viaarxiv icon

Mitigating Object Hallucination via Data Augmented Contrastive Tuning

Add code
May 28, 2024
Viaarxiv icon

Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment

Add code
Apr 18, 2024
Figure 1 for Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment
Figure 2 for Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment
Figure 3 for Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment
Figure 4 for Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment
Viaarxiv icon

Asymptotics of Language Model Alignment

Add code
Apr 02, 2024
Figure 1 for Asymptotics of Language Model Alignment
Viaarxiv icon

Optimal Block-Level Draft Verification for Accelerating Speculative Decoding

Add code
Mar 15, 2024
Figure 1 for Optimal Block-Level Draft Verification for Accelerating Speculative Decoding
Figure 2 for Optimal Block-Level Draft Verification for Accelerating Speculative Decoding
Figure 3 for Optimal Block-Level Draft Verification for Accelerating Speculative Decoding
Figure 4 for Optimal Block-Level Draft Verification for Accelerating Speculative Decoding
Viaarxiv icon

Gradient-Based Language Model Red Teaming

Add code
Jan 30, 2024
Figure 1 for Gradient-Based Language Model Red Teaming
Figure 2 for Gradient-Based Language Model Red Teaming
Figure 3 for Gradient-Based Language Model Red Teaming
Figure 4 for Gradient-Based Language Model Red Teaming
Viaarxiv icon