Picture for Hongkang Li

Hongkang Li

Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis

Add code
Oct 03, 2024
Viaarxiv icon

Learning on Transformers is Provable Low-Rank and Sparse: A One-layer Analysis

Add code
Jun 24, 2024
Viaarxiv icon

What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding

Add code
Jun 04, 2024
Figure 1 for What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
Figure 2 for What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
Figure 3 for What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
Figure 4 for What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
Viaarxiv icon

How does promoting the minority fraction affect generalization? A theoretical study of the one-hidden-layer neural network on group imbalance

Add code
Mar 19, 2024
Viaarxiv icon

Training Nonlinear Transformers for Efficient In-Context Learning: A Theoretical Learning and Generalization Analysis

Add code
Feb 23, 2024
Viaarxiv icon

On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $ε$-Greedy Exploration

Add code
Oct 24, 2023
Viaarxiv icon

How Can Context Help? Exploring Joint Retrieval of Passage and Personalized Context

Add code
Aug 26, 2023
Viaarxiv icon

A Theoretical Understanding of shallow Vision Transformers: Learning, Generalization, and Sample Complexity

Add code
Feb 12, 2023
Viaarxiv icon

Learning and generalization of one-hidden-layer neural networks, going beyond standard Gaussian data

Add code
Jul 07, 2022
Figure 1 for Learning and generalization of one-hidden-layer neural networks, going beyond standard Gaussian data
Figure 2 for Learning and generalization of one-hidden-layer neural networks, going beyond standard Gaussian data
Figure 3 for Learning and generalization of one-hidden-layer neural networks, going beyond standard Gaussian data
Viaarxiv icon

Generalization Guarantee of Training Graph Convolutional Networks with Graph Topology Sampling

Add code
Jul 07, 2022
Figure 1 for Generalization Guarantee of Training Graph Convolutional Networks with Graph Topology Sampling
Figure 2 for Generalization Guarantee of Training Graph Convolutional Networks with Graph Topology Sampling
Figure 3 for Generalization Guarantee of Training Graph Convolutional Networks with Graph Topology Sampling
Figure 4 for Generalization Guarantee of Training Graph Convolutional Networks with Graph Topology Sampling
Viaarxiv icon