Picture for Yongqiang Cai

Yongqiang Cai

Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition

Add code
Feb 28, 2025
Viaarxiv icon

Neural Networks Trained by Weight Permutation are Universal Approximators

Add code
Jul 01, 2024
Viaarxiv icon

A Minimal Control Family of Dynamical Syetem for Universal Approximation

Add code
Dec 20, 2023
Viaarxiv icon

Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation

Add code
May 29, 2023
Viaarxiv icon

Vocabulary for Universal Approximation: A Linguistic Perspective of Mapping Compositions

Add code
May 20, 2023
Viaarxiv icon

Achieve the Minimum Width of Neural Networks for Universal Approximation

Add code
Sep 23, 2022
Figure 1 for Achieve the Minimum Width of Neural Networks for Universal Approximation
Figure 2 for Achieve the Minimum Width of Neural Networks for Universal Approximation
Figure 3 for Achieve the Minimum Width of Neural Networks for Universal Approximation
Figure 4 for Achieve the Minimum Width of Neural Networks for Universal Approximation
Viaarxiv icon

Vanilla feedforward neural networks as a discretization of dynamic systems

Add code
Sep 22, 2022
Figure 1 for Vanilla feedforward neural networks as a discretization of dynamic systems
Figure 2 for Vanilla feedforward neural networks as a discretization of dynamic systems
Figure 3 for Vanilla feedforward neural networks as a discretization of dynamic systems
Viaarxiv icon

Optimization in Machine Learning: A Distribution Space Approach

Add code
Apr 18, 2020
Figure 1 for Optimization in Machine Learning: A Distribution Space Approach
Figure 2 for Optimization in Machine Learning: A Distribution Space Approach
Figure 3 for Optimization in Machine Learning: A Distribution Space Approach
Figure 4 for Optimization in Machine Learning: A Distribution Space Approach
Viaarxiv icon

On the Convergence and Robustness of Batch Normalization

Add code
Sep 29, 2018
Figure 1 for On the Convergence and Robustness of Batch Normalization
Figure 2 for On the Convergence and Robustness of Batch Normalization
Figure 3 for On the Convergence and Robustness of Batch Normalization
Figure 4 for On the Convergence and Robustness of Batch Normalization
Viaarxiv icon