Picture for Cunxiao Du

Cunxiao Du

SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction

Add code
Oct 17, 2024
Figure 1 for SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction
Figure 2 for SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction
Figure 3 for SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction
Figure 4 for SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction
Viaarxiv icon

When Attention Sink Emerges in Language Models: An Empirical View

Add code
Oct 14, 2024
Viaarxiv icon

Reverse Modeling in Large Language Models

Add code
Oct 13, 2024
Viaarxiv icon

SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration

Add code
Oct 09, 2024
Viaarxiv icon

Efficient Inference for Large Language Model-based Generative Recommendation

Add code
Oct 07, 2024
Figure 1 for Efficient Inference for Large Language Model-based Generative Recommendation
Figure 2 for Efficient Inference for Large Language Model-based Generative Recommendation
Figure 3 for Efficient Inference for Large Language Model-based Generative Recommendation
Figure 4 for Efficient Inference for Large Language Model-based Generative Recommendation
Viaarxiv icon

Revisiting the Markov Property for Machine Translation

Add code
Feb 03, 2024
Viaarxiv icon

GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding

Add code
Feb 03, 2024
Viaarxiv icon

ngram-OAXE: Phrase-Based Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Add code
Oct 08, 2022
Figure 1 for ngram-OAXE: Phrase-Based Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation
Figure 2 for ngram-OAXE: Phrase-Based Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation
Figure 3 for ngram-OAXE: Phrase-Based Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation
Figure 4 for ngram-OAXE: Phrase-Based Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation
Viaarxiv icon

Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Add code
Jun 09, 2021
Figure 1 for Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation
Figure 2 for Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation
Figure 3 for Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation
Figure 4 for Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation
Viaarxiv icon

Explicit Interaction Model towards Text Classification

Add code
Nov 23, 2018
Figure 1 for Explicit Interaction Model towards Text Classification
Figure 2 for Explicit Interaction Model towards Text Classification
Figure 3 for Explicit Interaction Model towards Text Classification
Figure 4 for Explicit Interaction Model towards Text Classification
Viaarxiv icon