Picture for Chaoyue Liu

Chaoyue Liu

Toward High-Performance Energy and Power Battery Cells with Machine Learning-based Optimization of Electrode Manufacturing

Add code
Jul 07, 2023
Viaarxiv icon

Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning

Add code
Jun 07, 2023
Viaarxiv icon

On Emergence of Clean-Priority Learning in Early Stopped Neural Networks

Add code
Jun 05, 2023
Viaarxiv icon

Aiming towards the minimizers: fast convergence of SGD for overparametrized problems

Add code
Jun 05, 2023
Viaarxiv icon

ReLU soothes the NTK condition number and accelerates optimization for wide neural networks

Add code
May 15, 2023
Viaarxiv icon

Quadratic models for understanding neural network dynamics

Add code
May 24, 2022
Figure 1 for Quadratic models for understanding neural network dynamics
Figure 2 for Quadratic models for understanding neural network dynamics
Figure 3 for Quadratic models for understanding neural network dynamics
Figure 4 for Quadratic models for understanding neural network dynamics
Viaarxiv icon

Transition to Linearity of General Neural Networks with Directed Acyclic Graph Architecture

Add code
May 24, 2022
Figure 1 for Transition to Linearity of General Neural Networks with Directed Acyclic Graph Architecture
Viaarxiv icon

Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak Models

Add code
Mar 10, 2022
Figure 1 for Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak Models
Figure 2 for Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak Models
Figure 3 for Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak Models
Viaarxiv icon

Hyper-parameter optimization based on soft actor critic and hierarchical mixture regularization

Add code
Dec 08, 2021
Figure 1 for Hyper-parameter optimization based on soft actor critic and hierarchical mixture regularization
Figure 2 for Hyper-parameter optimization based on soft actor critic and hierarchical mixture regularization
Figure 3 for Hyper-parameter optimization based on soft actor critic and hierarchical mixture regularization
Figure 4 for Hyper-parameter optimization based on soft actor critic and hierarchical mixture regularization
Viaarxiv icon

On the linearity of large non-linear models: when and why the tangent kernel is constant

Add code
Oct 02, 2020
Figure 1 for On the linearity of large non-linear models: when and why the tangent kernel is constant
Figure 2 for On the linearity of large non-linear models: when and why the tangent kernel is constant
Figure 3 for On the linearity of large non-linear models: when and why the tangent kernel is constant
Viaarxiv icon