Picture for Yanzhi Wang

Yanzhi Wang

RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation

Add code
Jan 08, 2025
Viaarxiv icon

All-in-One Tuning and Structural Pruning for Domain-Specific LLMs

Add code
Dec 19, 2024
Figure 1 for All-in-One Tuning and Structural Pruning for Domain-Specific LLMs
Figure 2 for All-in-One Tuning and Structural Pruning for Domain-Specific LLMs
Figure 3 for All-in-One Tuning and Structural Pruning for Domain-Specific LLMs
Figure 4 for All-in-One Tuning and Structural Pruning for Domain-Specific LLMs
Viaarxiv icon

Numerical Pruning for Efficient Autoregressive Models

Add code
Dec 17, 2024
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon

SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device

Add code
Dec 13, 2024
Viaarxiv icon

Open-Source Acceleration of Stable-Diffusion.cpp

Add code
Dec 08, 2024
Figure 1 for Open-Source Acceleration of Stable-Diffusion.cpp
Figure 2 for Open-Source Acceleration of Stable-Diffusion.cpp
Figure 3 for Open-Source Acceleration of Stable-Diffusion.cpp
Figure 4 for Open-Source Acceleration of Stable-Diffusion.cpp
Viaarxiv icon

Fully Open Source Moxin-7B Technical Report

Add code
Dec 08, 2024
Figure 1 for Fully Open Source Moxin-7B Technical Report
Figure 2 for Fully Open Source Moxin-7B Technical Report
Figure 3 for Fully Open Source Moxin-7B Technical Report
Figure 4 for Fully Open Source Moxin-7B Technical Report
Viaarxiv icon

Domain Adaptation-based Edge Computing for Cross-Conditions Fault Diagnosis

Add code
Nov 15, 2024
Figure 1 for Domain Adaptation-based Edge Computing for Cross-Conditions Fault Diagnosis
Figure 2 for Domain Adaptation-based Edge Computing for Cross-Conditions Fault Diagnosis
Figure 3 for Domain Adaptation-based Edge Computing for Cross-Conditions Fault Diagnosis
Figure 4 for Domain Adaptation-based Edge Computing for Cross-Conditions Fault Diagnosis
Viaarxiv icon

Fast and Memory-Efficient Video Diffusion Using Streamlined Inference

Add code
Nov 02, 2024
Figure 1 for Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
Figure 2 for Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
Figure 3 for Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
Figure 4 for Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
Viaarxiv icon

Pruning Foundation Models for High Accuracy without Retraining

Add code
Oct 21, 2024
Figure 1 for Pruning Foundation Models for High Accuracy without Retraining
Figure 2 for Pruning Foundation Models for High Accuracy without Retraining
Figure 3 for Pruning Foundation Models for High Accuracy without Retraining
Viaarxiv icon