Picture for Xiaobin Hu

Xiaobin Hu

The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection

Add code
Dec 23, 2025
Viaarxiv icon

Memory in the Age of AI Agents

Add code
Dec 15, 2025
Viaarxiv icon

Transform Trained Transformer: Accelerating Naive 4K Video Generation Over 10$\times$

Add code
Dec 15, 2025
Viaarxiv icon

Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation

Add code
Dec 15, 2025
Viaarxiv icon

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Add code
Nov 14, 2025
Viaarxiv icon

OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research

Add code
Oct 30, 2025
Viaarxiv icon

Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning

Add code
Jul 02, 2025
Viaarxiv icon

Identity-Preserving Text-to-Image Generation via Dual-Level Feature Decoupling and Expert-Guided Fusion

Add code
May 28, 2025
Viaarxiv icon

Align and Surpass Human Camouflaged Perception: Visual Refocus Reinforcement Fine-Tuning

Add code
May 26, 2025
Figure 1 for Align and Surpass Human Camouflaged Perception: Visual Refocus Reinforcement Fine-Tuning
Figure 2 for Align and Surpass Human Camouflaged Perception: Visual Refocus Reinforcement Fine-Tuning
Figure 3 for Align and Surpass Human Camouflaged Perception: Visual Refocus Reinforcement Fine-Tuning
Figure 4 for Align and Surpass Human Camouflaged Perception: Visual Refocus Reinforcement Fine-Tuning
Viaarxiv icon

Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation

Add code
Apr 25, 2025
Viaarxiv icon