Picture for Han Shi

Han Shi

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Add code
Dec 16, 2024
Viaarxiv icon

Efficient Multi-modal Large Language Models via Visual Token Grouping

Add code
Nov 26, 2024
Figure 1 for Efficient Multi-modal Large Language Models via Visual Token Grouping
Figure 2 for Efficient Multi-modal Large Language Models via Visual Token Grouping
Figure 3 for Efficient Multi-modal Large Language Models via Visual Token Grouping
Figure 4 for Efficient Multi-modal Large Language Models via Visual Token Grouping
Viaarxiv icon

DAPE V2: Process Attention Score as Feature Map for Length Extrapolation

Add code
Oct 07, 2024
Viaarxiv icon

Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding

Add code
Oct 02, 2024
Figure 1 for Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Figure 2 for Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Figure 3 for Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Figure 4 for Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Viaarxiv icon

QuickLLaMA: Query-aware Inference Acceleration for Large Language Models

Add code
Jun 11, 2024
Viaarxiv icon

DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Add code
May 23, 2024
Figure 1 for DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
Figure 2 for DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
Figure 3 for DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
Figure 4 for DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
Viaarxiv icon

CAPE: Context-Adaptive Positional Encoding for Length Extrapolation

Add code
May 23, 2024
Figure 1 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Figure 2 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Figure 3 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Figure 4 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Viaarxiv icon

On the Expressive Power of a Variant of the Looped Transformer

Add code
Feb 21, 2024
Figure 1 for On the Expressive Power of a Variant of the Looped Transformer
Figure 2 for On the Expressive Power of a Variant of the Looped Transformer
Figure 3 for On the Expressive Power of a Variant of the Looped Transformer
Figure 4 for On the Expressive Power of a Variant of the Looped Transformer
Viaarxiv icon

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

Add code
Feb 12, 2024
Figure 1 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Figure 2 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Figure 3 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Figure 4 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Viaarxiv icon

LEGO-Prover: Neural Theorem Proving with Growing Libraries

Add code
Oct 12, 2023
Figure 1 for LEGO-Prover: Neural Theorem Proving with Growing Libraries
Figure 2 for LEGO-Prover: Neural Theorem Proving with Growing Libraries
Figure 3 for LEGO-Prover: Neural Theorem Proving with Growing Libraries
Figure 4 for LEGO-Prover: Neural Theorem Proving with Growing Libraries
Viaarxiv icon