Picture for Ivan Evtimov

Ivan Evtimov

Jack

AdvPrefix: An Objective for Nuanced LLM Jailbreaks

Add code
Dec 13, 2024
Viaarxiv icon

Persistent Pre-Training Poisoning of LLMs

Add code
Oct 17, 2024
Viaarxiv icon

Gradient-based Jailbreak Images for Multimodal Fusion Models

Add code
Oct 04, 2024
Viaarxiv icon

Automated Red Teaming with GOAT: the Generative Offensive Agent Tester

Add code
Oct 02, 2024
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations

Add code
Apr 16, 2024
Viaarxiv icon

Towards Red Teaming in Multimodal and Multilingual Translation

Add code
Jan 29, 2024
Viaarxiv icon

Seamless: Multilingual Expressive and Streaming Speech Translation

Add code
Dec 08, 2023
Figure 1 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 2 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 3 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 4 for Seamless: Multilingual Expressive and Streaming Speech Translation
Viaarxiv icon

Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models

Add code
Dec 07, 2023
Figure 1 for Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models
Figure 2 for Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models
Figure 3 for Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models
Figure 4 for Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models
Viaarxiv icon

VPA: Fully Test-Time Visual Prompt Adaptation

Add code
Sep 26, 2023
Viaarxiv icon