Picture for Mario Fritz

Mario Fritz

Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models

Add code
Feb 28, 2025
Viaarxiv icon

Taxonomy, Opportunities, and Challenges of Representation Engineering for Large Language Models

Add code
Feb 27, 2025
Viaarxiv icon

MaxSup: Overcoming Representation Collapse in Label Smoothing

Add code
Feb 18, 2025
Viaarxiv icon

Safety is Essential for Responsible Open-Ended Systems

Add code
Feb 06, 2025
Viaarxiv icon

DocMIA: Document-Level Membership Inference Attacks against DocVQA Models

Add code
Feb 06, 2025
Figure 1 for DocMIA: Document-Level Membership Inference Attacks against DocVQA Models
Figure 2 for DocMIA: Document-Level Membership Inference Attacks against DocVQA Models
Figure 3 for DocMIA: Document-Level Membership Inference Attacks against DocVQA Models
Figure 4 for DocMIA: Document-Level Membership Inference Attacks against DocVQA Models
Viaarxiv icon

Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment

Add code
Feb 04, 2025
Viaarxiv icon

COMIX: Compositional Explanations using Prototypes

Add code
Jan 10, 2025
Viaarxiv icon

BiCert: A Bilinear Mixed Integer Programming Formulation for Precise Certified Bounds Against Data Poisoning Attacks

Add code
Dec 13, 2024
Viaarxiv icon

DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators

Add code
Dec 03, 2024
Figure 1 for DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators
Figure 2 for DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators
Figure 3 for DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators
Figure 4 for DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators
Viaarxiv icon

In-Context Experience Replay Facilitates Safety Red-Teaming of Text-to-Image Diffusion Models

Add code
Nov 25, 2024
Viaarxiv icon