Picture for Cheolhyoung Lee

Cheolhyoung Lee

Unsupervised Learning of Initialization in Deep Neural Networks via Maximum Mean Discrepancy

Add code
Feb 08, 2023
Viaarxiv icon

A Non-monotonic Self-terminating Language Model

Add code
Oct 03, 2022
Figure 1 for A Non-monotonic Self-terminating Language Model
Figure 2 for A Non-monotonic Self-terminating Language Model
Figure 3 for A Non-monotonic Self-terminating Language Model
Figure 4 for A Non-monotonic Self-terminating Language Model
Viaarxiv icon

Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models

Add code
Sep 25, 2019
Figure 1 for Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Figure 2 for Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Figure 3 for Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Figure 4 for Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Viaarxiv icon

Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning

Add code
Sep 29, 2018
Figure 1 for Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning
Figure 2 for Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning
Figure 3 for Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning
Figure 4 for Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning
Viaarxiv icon