text


InterleaveThinker: Reinforcing Agentic Interleaved Generation

Add code
Jun 11, 2026
Viaarxiv icon

Modality Forcing for Scalable Spatial Generation

Add code
Jun 11, 2026
Viaarxiv icon

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Add code
Jun 11, 2026
Viaarxiv icon

Flex4DHuman: Flexible Multi-view Video Diffusion for 4D Human Reconstruction

Add code
Jun 11, 2026
Viaarxiv icon

World Tracing: Generative Pixel-Aligned Geometry Beyond the Visible

Add code
Jun 11, 2026
Viaarxiv icon

SkMTEB: Slovak Massive Text Embedding Benchmark and Model Adaptation

Add code
Jun 11, 2026
Viaarxiv icon

From Tokens to Faces: Investigating Discrete Speech Representations for 3D Facial Animation

Add code
Jun 11, 2026
Viaarxiv icon

Revisiting Vehicle Color Recognition in Long-Tailed Surveillance Scenarios

Add code
Jun 11, 2026
Viaarxiv icon

The Tone of Awareness: Topic, Sentiment, and Toxicity Maps During Mental Health Month on TikTok

Add code
Jun 11, 2026
Viaarxiv icon

Edit the Bits, Diff the Codes: Bitwise Residual Editing for Visual Autoregressive Models

Add code
Jun 11, 2026
Viaarxiv icon