Picture for Chi-Heng Lin

Chi-Heng Lin

ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning

Add code
Jan 25, 2025
Viaarxiv icon

FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing

Add code
Jan 24, 2025
Viaarxiv icon

DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon

MoDeGPT: Modular Decomposition for Large Language Model Compression

Add code
Aug 20, 2024
Viaarxiv icon

DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling

Add code
May 01, 2024
Figure 1 for DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling
Figure 2 for DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling
Figure 3 for DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling
Figure 4 for DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling
Viaarxiv icon

Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance

Add code
Feb 18, 2024
Figure 1 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Figure 2 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Figure 3 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Figure 4 for Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Viaarxiv icon

Half-Hop: A graph upsampling approach for slowing down message passing

Add code
Aug 17, 2023
Figure 1 for Half-Hop: A graph upsampling approach for slowing down message passing
Figure 2 for Half-Hop: A graph upsampling approach for slowing down message passing
Figure 3 for Half-Hop: A graph upsampling approach for slowing down message passing
Figure 4 for Half-Hop: A graph upsampling approach for slowing down message passing
Viaarxiv icon

The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective

Add code
Oct 10, 2022
Figure 1 for The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective
Figure 2 for The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective
Figure 3 for The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective
Figure 4 for The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective
Viaarxiv icon

Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Łojasiewicz Functions when the Non-Convexity is Averaged-Out

Add code
Jun 22, 2022
Figure 1 for Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Łojasiewicz Functions when the Non-Convexity is Averaged-Out
Figure 2 for Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Łojasiewicz Functions when the Non-Convexity is Averaged-Out
Figure 3 for Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Łojasiewicz Functions when the Non-Convexity is Averaged-Out
Viaarxiv icon

Drop, Swap, and Generate: A Self-Supervised Approach for Generating Neural Activity

Add code
Nov 03, 2021
Viaarxiv icon