Picture for Mario Fritz

Mario Fritz

Certified Circuits: Stability Guarantees for Mechanistic Circuits

Add code
Feb 26, 2026
Viaarxiv icon

Scalable Delphi: Large Language Models for Structured Risk Estimation

Add code
Feb 09, 2026
Viaarxiv icon

IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery

Add code
Feb 08, 2026
Viaarxiv icon

Funny or Persuasive, but Not Both: Evaluating Fine-Grained Multi-Concept Control in LLMs

Add code
Jan 26, 2026
Viaarxiv icon

Probe-based Fine-tuning for Reducing Toxicity

Add code
Oct 24, 2025
Viaarxiv icon

Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches

Add code
Aug 29, 2025
Figure 1 for Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches
Figure 2 for Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches
Figure 3 for Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches
Figure 4 for Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches
Viaarxiv icon

Deepfake Detection that Generalizes Across Benchmarks

Add code
Aug 08, 2025
Viaarxiv icon

Pixel-level Certified Explanations via Randomized Smoothing

Add code
Jun 18, 2025
Viaarxiv icon

ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols

Add code
Jun 09, 2025
Viaarxiv icon

Stealix: Model Stealing via Prompt Evolution

Add code
Jun 06, 2025
Viaarxiv icon