Tokenization


Fast-Slow Efficient Training for Multimodal Large Language Models via Visual Token Pruning

Add code
Feb 03, 2026
Viaarxiv icon

Conformal Thinking: Risk Control for Reasoning on a Compute Budget

Add code
Feb 03, 2026
Viaarxiv icon

Context Compression via Explicit Information Transmission

Add code
Feb 03, 2026
Viaarxiv icon

UniGeM: Unifying Data Mixing and Selection via Geometric Exploration and Mining

Add code
Feb 03, 2026
Viaarxiv icon

Efficient Variance-reduced Estimation from Generative EHR Models: The SCOPE and REACH Estimators

Add code
Feb 03, 2026
Viaarxiv icon

Beyond Tokens: Semantic-Aware Speculative Decoding for Efficient Inference by Probing Internal States

Add code
Feb 03, 2026
Viaarxiv icon

Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems

Add code
Feb 03, 2026
Viaarxiv icon

Bringing Reasoning to Generative Recommendation Through the Lens of Cascaded Ranking

Add code
Feb 03, 2026
Viaarxiv icon

TodyComm: Task-Oriented Dynamic Communication for Multi-Round LLM-based Multi-Agent System

Add code
Feb 03, 2026
Viaarxiv icon

Instruction Anchors: Dissecting the Causal Dynamics of Modality Arbitration

Add code
Feb 03, 2026
Viaarxiv icon