Picture for Lu Yin

Lu Yin

The Curse of Depth in Large Language Models

Add code
Feb 09, 2025
Viaarxiv icon

Sebra: Debiasing Through Self-Guided Bias Ranking

Add code
Jan 30, 2025
Figure 1 for Sebra: Debiasing Through Self-Guided Bias Ranking
Figure 2 for Sebra: Debiasing Through Self-Guided Bias Ranking
Figure 3 for Sebra: Debiasing Through Self-Guided Bias Ranking
Figure 4 for Sebra: Debiasing Through Self-Guided Bias Ranking
Viaarxiv icon

Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN

Add code
Dec 18, 2024
Viaarxiv icon

Aspect-Based Few-Shot Learning

Add code
Dec 17, 2024
Viaarxiv icon

Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning

Add code
Nov 26, 2024
Figure 1 for Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning
Figure 2 for Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning
Figure 3 for Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning
Figure 4 for Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning
Viaarxiv icon

Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning

Add code
Nov 21, 2024
Figure 1 for Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning
Figure 2 for Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning
Figure 3 for Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning
Figure 4 for Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning
Viaarxiv icon

OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework

Add code
Nov 12, 2024
Viaarxiv icon

Multimodal Contrastive Learning of Urban Space Representations from POI Data

Add code
Nov 09, 2024
Viaarxiv icon

TODO: Enhancing LLM Alignment with Ternary Preferences

Add code
Nov 02, 2024
Figure 1 for TODO: Enhancing LLM Alignment with Ternary Preferences
Figure 2 for TODO: Enhancing LLM Alignment with Ternary Preferences
Figure 3 for TODO: Enhancing LLM Alignment with Ternary Preferences
Figure 4 for TODO: Enhancing LLM Alignment with Ternary Preferences
Viaarxiv icon

Full-Rank No More: Low-Rank Weight Training for Modern Speech Recognition Models

Add code
Oct 10, 2024
Figure 1 for Full-Rank No More: Low-Rank Weight Training for Modern Speech Recognition Models
Figure 2 for Full-Rank No More: Low-Rank Weight Training for Modern Speech Recognition Models
Figure 3 for Full-Rank No More: Low-Rank Weight Training for Modern Speech Recognition Models
Figure 4 for Full-Rank No More: Low-Rank Weight Training for Modern Speech Recognition Models
Viaarxiv icon