Picture for Zhiru Zhang

Zhiru Zhang

ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models

Add code
Jun 24, 2024
Viaarxiv icon

Differentiable Combinatorial Scheduling at Scale

Add code
Jun 06, 2024
Figure 1 for Differentiable Combinatorial Scheduling at Scale
Figure 2 for Differentiable Combinatorial Scheduling at Scale
Figure 3 for Differentiable Combinatorial Scheduling at Scale
Figure 4 for Differentiable Combinatorial Scheduling at Scale
Viaarxiv icon

Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs

Add code
May 06, 2024
Viaarxiv icon

Allo: A Programming Model for Composable Accelerator Design

Add code
Apr 07, 2024
Viaarxiv icon

Radial Networks: Dynamic Layer Routing for High-Performance Large Language Models

Add code
Apr 07, 2024
Viaarxiv icon

UniSparse: An Intermediate Language for General Sparse Format Customization

Add code
Mar 09, 2024
Viaarxiv icon

Less is More: Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits

Add code
Mar 06, 2024
Viaarxiv icon

Polynormer: Polynomial-Expressive Graph Transformer in Linear Time

Add code
Mar 02, 2024
Figure 1 for Polynormer: Polynomial-Expressive Graph Transformer in Linear Time
Figure 2 for Polynormer: Polynomial-Expressive Graph Transformer in Linear Time
Figure 3 for Polynormer: Polynomial-Expressive Graph Transformer in Linear Time
Figure 4 for Polynormer: Polynomial-Expressive Graph Transformer in Linear Time
Viaarxiv icon

Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel

Add code
Feb 21, 2024
Viaarxiv icon

SAGMAN: Stability Analysis of Graph Neural Networks on the Manifolds

Add code
Feb 21, 2024
Viaarxiv icon