Picture for Ningxin Zheng

Ningxin Zheng

ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Add code
Oct 28, 2024
Viaarxiv icon

FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion

Add code
Jun 12, 2024
Figure 1 for FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion
Figure 2 for FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion
Figure 3 for FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion
Figure 4 for FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion
Viaarxiv icon

EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention

Add code
May 11, 2023
Viaarxiv icon

SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference

Add code
Mar 15, 2023
Viaarxiv icon

Online Video Streaming Super-Resolution with Adaptive Look-Up Table Fusion

Add code
Mar 01, 2023
Viaarxiv icon

SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation

Add code
Jan 26, 2023
Viaarxiv icon

Online Video Super-Resolution with Convolutional Kernel Bypass Graft

Add code
Aug 04, 2022
Figure 1 for Online Video Super-Resolution with Convolutional Kernel Bypass Graft
Figure 2 for Online Video Super-Resolution with Convolutional Kernel Bypass Graft
Figure 3 for Online Video Super-Resolution with Convolutional Kernel Bypass Graft
Figure 4 for Online Video Super-Resolution with Convolutional Kernel Bypass Graft
Viaarxiv icon

Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision

Add code
Aug 30, 2021
Figure 1 for Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Figure 2 for Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Figure 3 for Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Figure 4 for Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Viaarxiv icon