Picture for Steve Bakos

Steve Bakos

AlignFreeze: Navigating the Impact of Realignment on the Layers of Multilingual Models Across Diverse Languages

Add code
Feb 18, 2025
Viaarxiv icon

Noise as a Double-Edged Sword: Reinforcement Learning Exploits Randomized Defenses in Neural Networks

Add code
Oct 31, 2024
Viaarxiv icon