Picture for Xuan Shen

Xuan Shen

RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation

Add code
Jan 08, 2025
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon

Numerical Pruning for Efficient Autoregressive Models

Add code
Dec 17, 2024
Viaarxiv icon

Fully Open Source Moxin-7B Technical Report

Add code
Dec 08, 2024
Figure 1 for Fully Open Source Moxin-7B Technical Report
Figure 2 for Fully Open Source Moxin-7B Technical Report
Figure 3 for Fully Open Source Moxin-7B Technical Report
Figure 4 for Fully Open Source Moxin-7B Technical Report
Viaarxiv icon

A Survey of Small Language Models

Add code
Oct 25, 2024
Figure 1 for A Survey of Small Language Models
Figure 2 for A Survey of Small Language Models
Figure 3 for A Survey of Small Language Models
Viaarxiv icon

Pruning Foundation Models for High Accuracy without Retraining

Add code
Oct 21, 2024
Figure 1 for Pruning Foundation Models for High Accuracy without Retraining
Figure 2 for Pruning Foundation Models for High Accuracy without Retraining
Figure 3 for Pruning Foundation Models for High Accuracy without Retraining
Viaarxiv icon

Rethinking Token Reduction for State Space Models

Add code
Oct 16, 2024
Figure 1 for Rethinking Token Reduction for State Space Models
Figure 2 for Rethinking Token Reduction for State Space Models
Figure 3 for Rethinking Token Reduction for State Space Models
Figure 4 for Rethinking Token Reduction for State Space Models
Viaarxiv icon

Exploring Token Pruning in Vision State Space Models

Add code
Sep 27, 2024
Figure 1 for Exploring Token Pruning in Vision State Space Models
Figure 2 for Exploring Token Pruning in Vision State Space Models
Figure 3 for Exploring Token Pruning in Vision State Space Models
Figure 4 for Exploring Token Pruning in Vision State Space Models
Viaarxiv icon

Search for Efficient Large Language Models

Add code
Sep 25, 2024
Figure 1 for Search for Efficient Large Language Models
Figure 2 for Search for Efficient Large Language Models
Figure 3 for Search for Efficient Large Language Models
Figure 4 for Search for Efficient Large Language Models
Viaarxiv icon

EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge

Add code
Feb 16, 2024
Viaarxiv icon