Picture for Ahmadreza Argha

Ahmadreza Argha

xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking

Add code
Jan 30, 2025
Figure 1 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 2 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 3 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 4 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Viaarxiv icon

ETAGE: Enhanced Test Time Adaptation with Integrated Entropy and Gradient Norms for Robust Model Performance

Add code
Sep 14, 2024
Viaarxiv icon

Revolutionizing Genomics with Reinforcement Learning Techniques

Add code
Feb 26, 2023
Viaarxiv icon