Picture for Xianglin Yang

Xianglin Yang

Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning

Add code
Jan 31, 2025
Figure 1 for Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning
Figure 2 for Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning
Figure 3 for Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning
Figure 4 for Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning
Viaarxiv icon

Exploring the Evolution of Hidden Activations with Live-Update Visualization

Add code
May 24, 2024
Viaarxiv icon

DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training

Add code
Dec 31, 2021
Figure 1 for DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training
Figure 2 for DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training
Figure 3 for DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training
Figure 4 for DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training
Viaarxiv icon