Picture for Huanqi Cao

Huanqi Cao

SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention

Add code
Jun 28, 2024
Viaarxiv icon

Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention

Add code
Jun 17, 2024
Viaarxiv icon

PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR

Add code
Jul 11, 2023
Viaarxiv icon

RWKV: Reinventing RNNs for the Transformer Era

Add code
May 22, 2023
Viaarxiv icon

CPM: A Large-scale Generative Chinese Pre-trained Language Model

Add code
Dec 01, 2020
Figure 1 for CPM: A Large-scale Generative Chinese Pre-trained Language Model
Figure 2 for CPM: A Large-scale Generative Chinese Pre-trained Language Model
Figure 3 for CPM: A Large-scale Generative Chinese Pre-trained Language Model
Figure 4 for CPM: A Large-scale Generative Chinese Pre-trained Language Model
Viaarxiv icon