Picture for Lingkun Long

Lingkun Long

Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing

Add code
Feb 02, 2026
Viaarxiv icon

SlimInfer: Accelerating Long-Context LLM Inference via Dynamic Token Pruning

Add code
Aug 08, 2025
Viaarxiv icon

TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices

Add code
Nov 03, 2023
Figure 1 for TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
Figure 2 for TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
Figure 3 for TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
Figure 4 for TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
Viaarxiv icon