Picture for Zhenyu Zhang

Zhenyu Zhang

MARS-Dragonfly: Agile and Robust Flight Control of Modular Aerial Robot Systems

Add code
Apr 07, 2026
Viaarxiv icon

CLEAR: Unlocking Generative Potential for Degraded Image Understanding in Unified Multimodal Models

Add code
Apr 06, 2026
Viaarxiv icon

MedLoc-R1: Performance-Aware Curriculum Reward Scheduling for GRPO-Based Medical Visual Grounding

Add code
Mar 30, 2026
Viaarxiv icon

Sparse Growing Transformer: Training-Time Sparse Depth Allocation via Progressive Attention Looping

Add code
Mar 25, 2026
Viaarxiv icon

VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs

Add code
Mar 24, 2026
Viaarxiv icon

CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention

Add code
Mar 18, 2026
Viaarxiv icon

Reclaiming Lost Text Layers for Source-Free Cross-Domain Few-Shot Learning

Add code
Mar 05, 2026
Viaarxiv icon

Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation

Add code
Mar 05, 2026
Viaarxiv icon

DIVA-GRPO: Enhancing Multimodal Reasoning through Difficulty-Adaptive Variant Advantage

Add code
Mar 01, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon