Picture for Hao Zhang

Hao Zhang

refer to the report for detailed contributions

AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models

Add code
Sep 16, 2025
Viaarxiv icon

Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation

Add code
Sep 16, 2025
Viaarxiv icon

Sensitivity-LoRA: Low-Load Sensitivity-Based Fine-Tuning for Large Language Models

Add code
Sep 11, 2025
Viaarxiv icon

Transferable Direct Prompt Injection via Activation-Guided MCMC Sampling

Add code
Sep 09, 2025
Viaarxiv icon

TemCoCo: Temporally Consistent Multi-modal Video Fusion with Visual-Semantic Collaboration

Add code
Aug 25, 2025
Viaarxiv icon

DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing

Add code
Aug 20, 2025
Viaarxiv icon

Constructing Invariant and Equivariant Operations by Symmetric Tensor Network

Add code
Aug 18, 2025
Viaarxiv icon

Audio-Thinker: Guiding Audio Language Model When and How to Think via Reinforcement Learning

Add code
Aug 12, 2025
Viaarxiv icon

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Add code
Jul 30, 2025
Viaarxiv icon

Interactive Adversarial Testing of Autonomous Vehicles with Adjustable Confrontation Intensity

Add code
Jul 29, 2025
Viaarxiv icon