Tokenization


Predicting Camera Pose from Perspective Descriptions for Spatial Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs

Add code
Feb 05, 2026
Viaarxiv icon

AP-OOD: Attention Pooling for Out-of-Distribution Detection

Add code
Feb 05, 2026
Viaarxiv icon

Transformers Are Born Biased: Structural Inductive Biases at Random Initialization and Their Practical Consequences

Add code
Feb 05, 2026
Viaarxiv icon

Verification of the Implicit World Model in a Generative Model via Adversarial Sequences

Add code
Feb 05, 2026
Viaarxiv icon

Focus-Scan-Refine: From Human Visual Perception to Efficient Visual Token Pruning

Add code
Feb 05, 2026
Viaarxiv icon

Variational Speculative Decoding: Rethinking Draft Training from Token Likelihood to Sequence Acceptance

Add code
Feb 05, 2026
Viaarxiv icon

Towards Green AI: Decoding the Energy of LLM Inference in Software Development

Add code
Feb 05, 2026
Viaarxiv icon

GLASS: A Generative Recommender for Long-sequence Modeling via SID-Tier and Semantic Search

Add code
Feb 05, 2026
Viaarxiv icon

Shiva-DiT: Residual-Based Differentiable Top-$k$ Selection for Efficient Diffusion Transformers

Add code
Feb 05, 2026
Viaarxiv icon