Picture for Xiaobin Hu

Xiaobin Hu

Future Forcing: Future-aware Training-free KV Cache Policy for Autoregressive Video Generation

Add code
May 28, 2026
Viaarxiv icon

What Semantics Survive the Connector? Diagnosing VLM-to-DiT Alignment in Video Editing

Add code
May 20, 2026
Viaarxiv icon

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Add code
May 19, 2026
Viaarxiv icon

PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset

Add code
May 19, 2026
Viaarxiv icon

SPIKE: An Adaptive Dual Controller Framework for Cost-Efficient Long-Horizon Game Agents

Add code
May 18, 2026
Viaarxiv icon

VPD-100K: Towards Generalizable and Fine-grained Visual Privacy Protection

Add code
May 11, 2026
Viaarxiv icon

4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding

Add code
May 07, 2026
Viaarxiv icon

SkillGraph: Self-Evolving Multi-Agent Collaboration with Multimodal Graph Topology

Add code
Apr 19, 2026
Viaarxiv icon

Evo-MedAgent: Beyond One-Shot Diagnosis with Agents That Remember, Reflect, and Improve

Add code
Apr 15, 2026
Viaarxiv icon

Evolution of Optimization Methods: Algorithms, Scenarios, and Evaluations

Add code
Apr 14, 2026
Viaarxiv icon