Picture for Futian Andrew Wei

Futian Andrew Wei

From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards

Add code
Mar 21, 2024
Viaarxiv icon