Picture for Genghan Zhang

Genghan Zhang

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Add code
Jul 05, 2024
Viaarxiv icon

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Add code
Jun 21, 2024
Viaarxiv icon

CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models

Add code
Apr 12, 2024
Viaarxiv icon

GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU

Add code
Apr 08, 2024
Viaarxiv icon

Canvas: End-to-End Kernel Architecture Search in Neural Networks

Add code
Apr 18, 2023
Viaarxiv icon