Picture for Hong Mei

Hong Mei

Why language models collapse when trained on recursively generated text

Add code
Dec 19, 2024
Figure 1 for Why language models collapse when trained on recursively generated text
Figure 2 for Why language models collapse when trained on recursively generated text
Figure 3 for Why language models collapse when trained on recursively generated text
Figure 4 for Why language models collapse when trained on recursively generated text
Viaarxiv icon

LoRA Dropout as a Sparsity Regularizer for Overfitting Control

Add code
Apr 15, 2024
Figure 1 for LoRA Dropout as a Sparsity Regularizer for Overfitting Control
Figure 2 for LoRA Dropout as a Sparsity Regularizer for Overfitting Control
Figure 3 for LoRA Dropout as a Sparsity Regularizer for Overfitting Control
Figure 4 for LoRA Dropout as a Sparsity Regularizer for Overfitting Control
Viaarxiv icon

Exploring the Potential of Large Language Models in Graph Generation

Add code
Mar 21, 2024
Viaarxiv icon

Improving Code Generation by Dynamic Temperature Sampling

Add code
Sep 06, 2023
Viaarxiv icon

DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training

Add code
Dec 31, 2021
Figure 1 for DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training
Figure 2 for DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training
Figure 3 for DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training
Figure 4 for DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training
Viaarxiv icon

Massive Self-Assembly in Grid Environments

Add code
Feb 23, 2021
Figure 1 for Massive Self-Assembly in Grid Environments
Figure 2 for Massive Self-Assembly in Grid Environments
Figure 3 for Massive Self-Assembly in Grid Environments
Viaarxiv icon

SpotTune: Leveraging Transient Resources for Cost-efficient Hyper-parameter Tuning in the Public Cloud

Add code
Dec 07, 2020
Figure 1 for SpotTune: Leveraging Transient Resources for Cost-efficient Hyper-parameter Tuning in the Public Cloud
Figure 2 for SpotTune: Leveraging Transient Resources for Cost-efficient Hyper-parameter Tuning in the Public Cloud
Figure 3 for SpotTune: Leveraging Transient Resources for Cost-efficient Hyper-parameter Tuning in the Public Cloud
Figure 4 for SpotTune: Leveraging Transient Resources for Cost-efficient Hyper-parameter Tuning in the Public Cloud
Viaarxiv icon