Picture for Yeonju Ro

Yeonju Ro

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design

Add code
Oct 24, 2024
Figure 1 for Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Figure 2 for Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Figure 3 for Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Figure 4 for Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Viaarxiv icon

FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping

Add code
Apr 05, 2024
Viaarxiv icon

Sequential Encryption of Sparse Neural Networks Toward Optimum Representation of Irregular Sparsity

Add code
May 05, 2021
Figure 1 for Sequential Encryption of Sparse Neural Networks Toward Optimum Representation of Irregular Sparsity
Figure 2 for Sequential Encryption of Sparse Neural Networks Toward Optimum Representation of Irregular Sparsity
Figure 3 for Sequential Encryption of Sparse Neural Networks Toward Optimum Representation of Irregular Sparsity
Figure 4 for Sequential Encryption of Sparse Neural Networks Toward Optimum Representation of Irregular Sparsity
Viaarxiv icon

Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization

Add code
May 05, 2021
Figure 1 for Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization
Figure 2 for Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization
Figure 3 for Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization
Figure 4 for Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization
Viaarxiv icon