Picture for Han Shi

Han Shi

DAPE V2: Process Attention Score as Feature Map for Length Extrapolation

Add code
Oct 07, 2024
Viaarxiv icon

Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding

Add code
Oct 02, 2024
Figure 1 for Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Figure 2 for Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Figure 3 for Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Figure 4 for Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Viaarxiv icon

QuickLLaMA: Query-aware Inference Acceleration for Large Language Models

Add code
Jun 11, 2024
Viaarxiv icon

CAPE: Context-Adaptive Positional Encoding for Length Extrapolation

Add code
May 23, 2024
Viaarxiv icon

DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Add code
May 23, 2024
Viaarxiv icon

On the Expressive Power of a Variant of the Looped Transformer

Add code
Feb 21, 2024
Viaarxiv icon

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

Add code
Feb 12, 2024
Viaarxiv icon

LEGO-Prover: Neural Theorem Proving with Growing Libraries

Add code
Oct 12, 2023
Viaarxiv icon

Effective and Parameter-Efficient Reusing Fine-Tuned Models

Add code
Oct 04, 2023
Viaarxiv icon

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Add code
Sep 22, 2023
Viaarxiv icon