Picture for Patrick Schramowski

Patrick Schramowski

MSTS: A Multimodal Safety Test Suite for Vision-Language Models

Add code
Jan 17, 2025
Viaarxiv icon

LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps

Add code
Dec 19, 2024
Figure 1 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 2 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 3 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 4 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Viaarxiv icon

SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Add code
Nov 11, 2024
Viaarxiv icon

Core Tokensets for Data-efficient Sequential Training of Transformers

Add code
Oct 08, 2024
Figure 1 for Core Tokensets for Data-efficient Sequential Training of Transformers
Figure 2 for Core Tokensets for Data-efficient Sequential Training of Transformers
Figure 3 for Core Tokensets for Data-efficient Sequential Training of Transformers
Figure 4 for Core Tokensets for Data-efficient Sequential Training of Transformers
Viaarxiv icon

Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning

Add code
Jul 03, 2024
Viaarxiv icon

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Add code
Jun 27, 2024
Figure 1 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 2 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 3 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 4 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Viaarxiv icon

LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment

Add code
Jun 07, 2024
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming

Add code
Apr 06, 2024
Figure 1 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 2 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 3 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 4 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Viaarxiv icon

DeiSAM: Segment Anything with Deictic Prompting

Add code
Feb 21, 2024
Figure 1 for DeiSAM: Segment Anything with Deictic Prompting
Figure 2 for DeiSAM: Segment Anything with Deictic Prompting
Figure 3 for DeiSAM: Segment Anything with Deictic Prompting
Figure 4 for DeiSAM: Segment Anything with Deictic Prompting
Viaarxiv icon