Picture for Ali Jadbabaie

Ali Jadbabaie

Improved Sample Complexity of Imitation Learning for Barrier Model Predictive Control

Add code
Oct 01, 2024
Viaarxiv icon

Residual Connections and Normalization Can Provably Prevent Oversmoothing in GNNs

Add code
Jun 05, 2024
Viaarxiv icon

On the Role of Attention Masks and LayerNorm in Transformers

Add code
May 29, 2024
Figure 1 for On the Role of Attention Masks and LayerNorm in Transformers
Figure 2 for On the Role of Attention Masks and LayerNorm in Transformers
Figure 3 for On the Role of Attention Masks and LayerNorm in Transformers
Figure 4 for On the Role of Attention Masks and LayerNorm in Transformers
Viaarxiv icon

A least-square method for non-asymptotic identification in linear switching control

Add code
Apr 11, 2024
Viaarxiv icon

Belief Samples Are All You Need For Social Learning

Add code
Mar 25, 2024
Viaarxiv icon

Linear attention is (maybe) all you need (to understand transformer optimization)

Add code
Oct 02, 2023
Figure 1 for Linear attention is (maybe) all you need (to understand transformer optimization)
Figure 2 for Linear attention is (maybe) all you need (to understand transformer optimization)
Figure 3 for Linear attention is (maybe) all you need (to understand transformer optimization)
Figure 4 for Linear attention is (maybe) all you need (to understand transformer optimization)
Viaarxiv icon

Convex and Non-Convex Optimization under Generalized Smoothness

Add code
Jun 02, 2023
Figure 1 for Convex and Non-Convex Optimization under Generalized Smoothness
Figure 2 for Convex and Non-Convex Optimization under Generalized Smoothness
Figure 3 for Convex and Non-Convex Optimization under Generalized Smoothness
Viaarxiv icon

Smooth Model Predictive Control with Applications to Statistical Learning

Add code
Jun 02, 2023
Viaarxiv icon

How to escape sharp minima

Add code
May 25, 2023
Viaarxiv icon

Demystifying Oversmoothing in Attention-Based Graph Neural Networks

Add code
May 25, 2023
Figure 1 for Demystifying Oversmoothing in Attention-Based Graph Neural Networks
Figure 2 for Demystifying Oversmoothing in Attention-Based Graph Neural Networks
Figure 3 for Demystifying Oversmoothing in Attention-Based Graph Neural Networks
Viaarxiv icon