Picture for Yisen Wang

Yisen Wang

Dissecting the Failure of Invariant Learning on Graphs

Add code
Nov 05, 2024
Viaarxiv icon

What is Wrong with Perplexity for Long-context Language Modeling?

Add code
Oct 31, 2024
Viaarxiv icon

Beyond Interpretability: The Gains of Feature Monosemanticity on Model Robustness

Add code
Oct 27, 2024
Viaarxiv icon

Can In-context Learning Really Generalize to Out-of-distribution Tasks?

Add code
Oct 13, 2024
Viaarxiv icon

AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation

Add code
Oct 11, 2024
Figure 1 for AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation
Figure 2 for AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation
Figure 3 for AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation
Figure 4 for AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation
Viaarxiv icon

EKAN: Equivariant Kolmogorov-Arnold Networks

Add code
Oct 01, 2024
Viaarxiv icon

TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors

Add code
Sep 09, 2024
Figure 1 for TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors
Figure 2 for TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors
Figure 3 for TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors
Figure 4 for TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors
Viaarxiv icon

CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation

Add code
Jul 16, 2024
Viaarxiv icon

On the Role of Discrete Tokenization in Visual Representation Learning

Add code
Jul 12, 2024
Viaarxiv icon

Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining

Add code
Jul 01, 2024
Viaarxiv icon