Picture for Yihao Zhang

Yihao Zhang

Causal Abstraction in Model Interpretability: A Compact Survey

Add code
Oct 26, 2024
Viaarxiv icon

MILE: A Mutation Testing Framework of In-Context Learning Systems

Add code
Sep 07, 2024
Viaarxiv icon

ShapeICP: Iterative Category-level Object Pose and Shape Estimation from Depth

Add code
Aug 23, 2024
Viaarxiv icon

Automata Extraction from Transformers

Add code
Jun 08, 2024
Viaarxiv icon

Boosting Jailbreak Attack with Momentum

Add code
May 02, 2024
Viaarxiv icon

Exploring the Robustness of In-Context Learning with Noisy Labels

Add code
May 01, 2024
Viaarxiv icon

Towards General Conceptual Model Editing via Adversarial Representation Engineering

Add code
Apr 21, 2024
Viaarxiv icon

On the Duality Between Sharpness-Aware Minimization and Adversarial Training

Add code
Feb 23, 2024
Figure 1 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Figure 2 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Figure 3 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Figure 4 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Viaarxiv icon

Building a digital twin of EDFA: a grey-box modeling approach

Add code
Jul 13, 2023
Viaarxiv icon

Weighted Automata Extraction and Explanation of Recurrent Neural Networks for Natural Language Tasks

Add code
Jun 24, 2023
Viaarxiv icon