Picture for Mateusz Dziemian

Mateusz Dziemian

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Add code
Oct 11, 2024
Viaarxiv icon

Applying Refusal-Vector Ablation to Llama 3.1 70B Agents

Add code
Oct 08, 2024
Viaarxiv icon