Picture for Yuyang Ma

Yuyang Ma

Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs with Minimal Human Interventions

Add code
Feb 08, 2025
Viaarxiv icon