Sliding Window Attention


STILL: Selecting Tokens for Intra-Layer Hybrid Attention to Linearize LLMs

Add code
Feb 02, 2026
Viaarxiv icon

More Than a Quick Glance: Overcoming the Greedy Bias in KV-Cache Compression

Add code
Feb 02, 2026
Viaarxiv icon

Power-based Partial Attention: Bridging Linear-Complexity and Full Attention

Add code
Jan 27, 2026
Viaarxiv icon

Window-Diffusion: Accelerating Diffusion Language Model Inference with Windowed Token Pruning and Caching

Add code
Jan 28, 2026
Viaarxiv icon

EEG-Titans: Long-Horizon Seizure Forecasting via Dual-Branch Attention and Neural Memory

Add code
Jan 20, 2026
Viaarxiv icon

Forecasting Continuum Intensity for Solar Active Region Emergence Prediction using Transformers

Add code
Jan 19, 2026
Viaarxiv icon

LiQSS: Post-Transformer Linear Quantum-Inspired State-Space Tensor Networks for Real-Time 6G

Add code
Jan 18, 2026
Viaarxiv icon

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

UDPNet: Unleashing Depth-based Priors for Robust Image Dehazing

Add code
Jan 11, 2026
Viaarxiv icon

Spectral-Window Hybrid (SWH)

Add code
Jan 04, 2026
Viaarxiv icon