Picture for Hidenori Tanaka

Hidenori Tanaka

Forking Paths in Neural Text Generation

Add code
Dec 10, 2024
Viaarxiv icon

Competition Dynamics Shape Algorithmic Phases of In-Context Learning

Add code
Dec 01, 2024
Viaarxiv icon

Continuous-Time Analysis of Adaptive Optimization and Normalization

Add code
Nov 08, 2024
Figure 1 for Continuous-Time Analysis of Adaptive Optimization and Normalization
Figure 2 for Continuous-Time Analysis of Adaptive Optimization and Normalization
Figure 3 for Continuous-Time Analysis of Adaptive Optimization and Normalization
Figure 4 for Continuous-Time Analysis of Adaptive Optimization and Normalization
Viaarxiv icon

Representation Shattering in Transformers: A Synthetic Study with Knowledge Editing

Add code
Oct 22, 2024
Viaarxiv icon

Dynamics of Concept Learning and Compositional Generalization

Add code
Oct 10, 2024
Figure 1 for Dynamics of Concept Learning and Compositional Generalization
Figure 2 for Dynamics of Concept Learning and Compositional Generalization
Figure 3 for Dynamics of Concept Learning and Compositional Generalization
Figure 4 for Dynamics of Concept Learning and Compositional Generalization
Viaarxiv icon

A Percolation Model of Emergence: Analyzing Transformers Trained on a Formal Language

Add code
Aug 22, 2024
Viaarxiv icon

Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space

Add code
Jun 27, 2024
Viaarxiv icon

Towards an Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation Model

Add code
Feb 12, 2024
Viaarxiv icon

Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks

Add code
Nov 21, 2023
Viaarxiv icon

How Capable Can a Transformer Become? A Study on Synthetic, Interpretable Tasks

Add code
Nov 21, 2023
Viaarxiv icon