Picture for Chi-Heng Lin

Chi-Heng Lin

DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon

MoDeGPT: Modular Decomposition for Large Language Model Compression

Add code
Aug 20, 2024
Viaarxiv icon

DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling

Add code
May 01, 2024
Viaarxiv icon

Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance

Add code
Feb 18, 2024
Figure 1 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Figure 2 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Figure 3 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Figure 4 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Viaarxiv icon

Half-Hop: A graph upsampling approach for slowing down message passing

Add code
Aug 17, 2023
Figure 1 for Half-Hop: A graph upsampling approach for slowing down message passing
Figure 2 for Half-Hop: A graph upsampling approach for slowing down message passing
Figure 3 for Half-Hop: A graph upsampling approach for slowing down message passing
Figure 4 for Half-Hop: A graph upsampling approach for slowing down message passing
Viaarxiv icon

The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective

Add code
Oct 10, 2022
Figure 1 for The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective
Figure 2 for The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective
Figure 3 for The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective
Figure 4 for The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective
Viaarxiv icon

Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Łojasiewicz Functions when the Non-Convexity is Averaged-Out

Add code
Jun 22, 2022
Figure 1 for Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Łojasiewicz Functions when the Non-Convexity is Averaged-Out
Figure 2 for Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Łojasiewicz Functions when the Non-Convexity is Averaged-Out
Figure 3 for Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Łojasiewicz Functions when the Non-Convexity is Averaged-Out
Viaarxiv icon

Drop, Swap, and Generate: A Self-Supervised Approach for Generating Neural Activity

Add code
Nov 03, 2021
Viaarxiv icon

Escaping Saddle Points Faster with Stochastic Momentum

Add code
Jun 05, 2021
Figure 1 for Escaping Saddle Points Faster with Stochastic Momentum
Figure 2 for Escaping Saddle Points Faster with Stochastic Momentum
Figure 3 for Escaping Saddle Points Faster with Stochastic Momentum
Figure 4 for Escaping Saddle Points Faster with Stochastic Momentum
Viaarxiv icon

Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction

Add code
Feb 19, 2021
Figure 1 for Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction
Figure 2 for Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction
Figure 3 for Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction
Figure 4 for Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction
Viaarxiv icon