Picture for Yuan Cao

Yuan Cao

Hefei National Laboratory for Physical Sciences at Microscale and Department of Modern Physics, University of Science and Technology of China, Hefei, China, Shanghai Branch, CAS Center for Excellence in Quantum Information and Quantum Physics, University of Science and Technology of China, Shanghai, China, Shanghai Research Center for Quantum Sciences, Shanghai, China

Transformers Simulate MLE for Sequence Generation in Bayesian Networks

Add code
Jan 05, 2025
Figure 1 for Transformers Simulate MLE for Sequence Generation in Bayesian Networks
Figure 2 for Transformers Simulate MLE for Sequence Generation in Bayesian Networks
Figure 3 for Transformers Simulate MLE for Sequence Generation in Bayesian Networks
Figure 4 for Transformers Simulate MLE for Sequence Generation in Bayesian Networks
Viaarxiv icon

Learning Spectral Methods by Transformers

Add code
Jan 05, 2025
Figure 1 for Learning Spectral Methods by Transformers
Figure 2 for Learning Spectral Methods by Transformers
Figure 3 for Learning Spectral Methods by Transformers
Figure 4 for Learning Spectral Methods by Transformers
Viaarxiv icon

Towards Simple and Provable Parameter-Free Adaptive Gradient Methods

Add code
Dec 27, 2024
Viaarxiv icon

On the Feature Learning in Diffusion Models

Add code
Dec 02, 2024
Viaarxiv icon

One-Layer Transformer Provably Learns One-Nearest Neighbor In Context

Add code
Nov 16, 2024
Viaarxiv icon

On the Comparison between Multi-modal and Single-modal Contrastive Learning

Add code
Nov 05, 2024
Viaarxiv icon

Global Convergence in Training Large-Scale Transformers

Add code
Oct 31, 2024
Figure 1 for Global Convergence in Training Large-Scale Transformers
Figure 2 for Global Convergence in Training Large-Scale Transformers
Viaarxiv icon

Initialization Matters: On the Benign Overfitting of Two-Layer ReLU CNN with Fully Trainable Layers

Add code
Oct 24, 2024
Viaarxiv icon

Understanding the Benefits of SimCLR Pre-Training in Two-Layer Convolutional Neural Networks

Add code
Sep 27, 2024
Viaarxiv icon

Improving Fast Adversarial Training via Self-Knowledge Guidance

Add code
Sep 26, 2024
Viaarxiv icon