Picture for Yiqiao Zhong

Yiqiao Zhong

How Does Unfaithful Reasoning Emerge from Autoregressive Training? A Study of Synthetic Experiments

Add code
Feb 01, 2026
Viaarxiv icon

Shattered Compositionality: Counterintuitive Learning Dynamics of Transformers for Arithmetic

Add code
Jan 30, 2026
Viaarxiv icon

Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning

Add code
May 24, 2025
Viaarxiv icon

Assessing and improving reliability of neighbor embedding methods: a map-continuity perspective

Add code
Oct 22, 2024
Figure 1 for Assessing and improving reliability of neighbor embedding methods: a map-continuity perspective
Figure 2 for Assessing and improving reliability of neighbor embedding methods: a map-continuity perspective
Figure 3 for Assessing and improving reliability of neighbor embedding methods: a map-continuity perspective
Figure 4 for Assessing and improving reliability of neighbor embedding methods: a map-continuity perspective
Viaarxiv icon

How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes

Add code
Apr 04, 2024
Viaarxiv icon

Uncovering hidden geometry in Transformers via disentangling position and context

Add code
Oct 07, 2023
Figure 1 for Uncovering hidden geometry in Transformers via disentangling position and context
Figure 2 for Uncovering hidden geometry in Transformers via disentangling position and context
Figure 3 for Uncovering hidden geometry in Transformers via disentangling position and context
Figure 4 for Uncovering hidden geometry in Transformers via disentangling position and context
Viaarxiv icon

Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage

Add code
Jun 06, 2023
Figure 1 for Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage
Figure 2 for Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage
Figure 3 for Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage
Figure 4 for Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage
Viaarxiv icon

Tractability from overparametrization: The example of the negative perceptron

Add code
Oct 28, 2021
Figure 1 for Tractability from overparametrization: The example of the negative perceptron
Figure 2 for Tractability from overparametrization: The example of the negative perceptron
Figure 3 for Tractability from overparametrization: The example of the negative perceptron
Figure 4 for Tractability from overparametrization: The example of the negative perceptron
Viaarxiv icon

The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training

Add code
Jul 25, 2020
Figure 1 for The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training
Figure 2 for The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training
Figure 3 for The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training
Figure 4 for The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training
Viaarxiv icon

A Selective Overview of Deep Learning

Add code
Apr 15, 2019
Figure 1 for A Selective Overview of Deep Learning
Figure 2 for A Selective Overview of Deep Learning
Figure 3 for A Selective Overview of Deep Learning
Figure 4 for A Selective Overview of Deep Learning
Viaarxiv icon