Picture for Yuan Cao

Yuan Cao

Hefei National Laboratory for Physical Sciences at Microscale and Department of Modern Physics, University of Science and Technology of China, Hefei, China, Shanghai Branch, CAS Center for Excellence in Quantum Information and Quantum Physics, University of Science and Technology of China, Shanghai, China, Shanghai Research Center for Quantum Sciences, Shanghai, China

On the Robustness of Transformers against Context Hijacking for Linear Classification

Add code
Feb 21, 2025
Viaarxiv icon

Transformers versus the EM Algorithm in Multi-class Clustering

Add code
Feb 09, 2025
Viaarxiv icon

Transformers and Their Roles as Time Series Foundation Models

Add code
Feb 05, 2025
Figure 1 for Transformers and Their Roles as Time Series Foundation Models
Figure 2 for Transformers and Their Roles as Time Series Foundation Models
Figure 3 for Transformers and Their Roles as Time Series Foundation Models
Figure 4 for Transformers and Their Roles as Time Series Foundation Models
Viaarxiv icon

Learning Spectral Methods by Transformers

Add code
Jan 05, 2025
Figure 1 for Learning Spectral Methods by Transformers
Figure 2 for Learning Spectral Methods by Transformers
Figure 3 for Learning Spectral Methods by Transformers
Figure 4 for Learning Spectral Methods by Transformers
Viaarxiv icon

Transformers Simulate MLE for Sequence Generation in Bayesian Networks

Add code
Jan 05, 2025
Figure 1 for Transformers Simulate MLE for Sequence Generation in Bayesian Networks
Figure 2 for Transformers Simulate MLE for Sequence Generation in Bayesian Networks
Figure 3 for Transformers Simulate MLE for Sequence Generation in Bayesian Networks
Figure 4 for Transformers Simulate MLE for Sequence Generation in Bayesian Networks
Viaarxiv icon

Towards Simple and Provable Parameter-Free Adaptive Gradient Methods

Add code
Dec 27, 2024
Viaarxiv icon

On the Feature Learning in Diffusion Models

Add code
Dec 02, 2024
Viaarxiv icon

One-Layer Transformer Provably Learns One-Nearest Neighbor In Context

Add code
Nov 16, 2024
Viaarxiv icon

On the Comparison between Multi-modal and Single-modal Contrastive Learning

Add code
Nov 05, 2024
Viaarxiv icon

Global Convergence in Training Large-Scale Transformers

Add code
Oct 31, 2024
Figure 1 for Global Convergence in Training Large-Scale Transformers
Figure 2 for Global Convergence in Training Large-Scale Transformers
Viaarxiv icon