Picture for Longxiang Wang

Longxiang Wang

CALM: Curiosity-Driven Auditing for Large Language Models

Add code
Jan 06, 2025
Viaarxiv icon