Picture for Yiwu Yao

Yiwu Yao

RazorAttention: Efficient KV Cache Compression Through Retrieval Heads

Add code
Jul 22, 2024
Viaarxiv icon

Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs

Add code
Oct 17, 2023
Viaarxiv icon

Extremely Low Footprint End-to-End ASR System for Smart Device

Add code
Apr 26, 2021
Figure 1 for Extremely Low Footprint End-to-End ASR System for Smart Device
Figure 2 for Extremely Low Footprint End-to-End ASR System for Smart Device
Figure 3 for Extremely Low Footprint End-to-End ASR System for Smart Device
Figure 4 for Extremely Low Footprint End-to-End ASR System for Smart Device
Viaarxiv icon

INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices

Add code
Oct 28, 2020
Figure 1 for INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices
Figure 2 for INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices
Figure 3 for INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices
Figure 4 for INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices
Viaarxiv icon

Fully Parallel Architecture for Semi-global Stereo Matching with Refined Rank Method

Add code
May 07, 2019
Figure 1 for Fully Parallel Architecture for Semi-global Stereo Matching with Refined Rank Method
Figure 2 for Fully Parallel Architecture for Semi-global Stereo Matching with Refined Rank Method
Figure 3 for Fully Parallel Architecture for Semi-global Stereo Matching with Refined Rank Method
Figure 4 for Fully Parallel Architecture for Semi-global Stereo Matching with Refined Rank Method
Viaarxiv icon

Creating Lightweight Object Detectors with Model Compression for Deployment on Edge Devices

Add code
May 06, 2019
Figure 1 for Creating Lightweight Object Detectors with Model Compression for Deployment on Edge Devices
Figure 2 for Creating Lightweight Object Detectors with Model Compression for Deployment on Edge Devices
Figure 3 for Creating Lightweight Object Detectors with Model Compression for Deployment on Edge Devices
Figure 4 for Creating Lightweight Object Detectors with Model Compression for Deployment on Edge Devices
Viaarxiv icon