Picture for Xiaobin Hu

Xiaobin Hu

MambaADv2: Evolving Duality-enhanced State Space Model for Unsupervised Anomaly Detection

Add code
Jun 22, 2026
Viaarxiv icon

SPOT-E: Test-Time Entropy Shaping with Visual Spotlights for Frozen VLMs

Add code
Jun 18, 2026
Viaarxiv icon

OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation

Add code
Jun 16, 2026
Viaarxiv icon

Last But Not Least: Boundary Attention CalibratiON for Multimodal KV Cache Compression

Add code
Jun 16, 2026
Viaarxiv icon

TouchThinker: Scaling Tactile Commonsense Reasoning to the Open World with Large-scale Data and Action-aware Representation

Add code
Jun 10, 2026
Viaarxiv icon

Audio Interaction Model

Add code
Jun 03, 2026
Viaarxiv icon

JAVEDIT: Joint Audio-Visual Instruction-Guided Video Editing with Agentic Data Curation

Add code
Jun 02, 2026
Viaarxiv icon

Future Forcing: Future-aware Training-free KV Cache Policy for Autoregressive Video Generation

Add code
May 28, 2026
Viaarxiv icon

What Semantics Survive the Connector? Diagnosing VLM-to-DiT Alignment in Video Editing

Add code
May 20, 2026
Viaarxiv icon

PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset

Add code
May 19, 2026
Viaarxiv icon