Picture for Wilson Wu

Wilson Wu

Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations

Add code
Oct 09, 2024
Figure 1 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 2 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 3 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 4 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Viaarxiv icon

Do language models plan ahead for future tokens?

Add code
Apr 01, 2024
Viaarxiv icon

Learning Deterministic Finite Automata from Confidence Oracles

Add code
Nov 18, 2023
Viaarxiv icon

Generating Semantic Adversarial Examples with Differentiable Rendering

Add code
Oct 02, 2019
Figure 1 for Generating Semantic Adversarial Examples with Differentiable Rendering
Figure 2 for Generating Semantic Adversarial Examples with Differentiable Rendering
Figure 3 for Generating Semantic Adversarial Examples with Differentiable Rendering
Figure 4 for Generating Semantic Adversarial Examples with Differentiable Rendering
Viaarxiv icon