Picture for Yudong Zhang

Yudong Zhang

MagicGUI-RMS: A Multi-Agent Reward Model System for Self-Evolving GUI Agents via Automated Feedback Reflux

Add code
Jan 19, 2026
Viaarxiv icon

MGML: A Plug-and-Play Meta-Guided Multi-Modal Learning Framework for Incomplete Multimodal Brain Tumor Segmentation

Add code
Dec 30, 2025
Viaarxiv icon

Task-Oriented Data Synthesis and Control-Rectify Sampling for Remote Sensing Semantic Segmentation

Add code
Dec 18, 2025
Viaarxiv icon

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Add code
Nov 14, 2025
Viaarxiv icon

MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control

Add code
Oct 01, 2025
Figure 1 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Figure 2 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Figure 3 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Figure 4 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Viaarxiv icon

Cross-Modal Clustering-Guided Negative Sampling for Self-Supervised Joint Learning from Medical Images and Reports

Add code
Jun 13, 2025
Viaarxiv icon

DMAF-Net: An Effective Modality Rebalancing Framework for Incomplete Multi-Modal Medical Image Segmentation

Add code
Jun 13, 2025
Viaarxiv icon

SWDL: Stratum-Wise Difference Learning with Deep Laplacian Pyramid for Semi-Supervised 3D Intracranial Hemorrhage Segmentation

Add code
Jun 12, 2025
Viaarxiv icon

MSLAU-Net: A Hybird CNN-Transformer Network for Medical Image Segmentation

Add code
May 24, 2025
Figure 1 for MSLAU-Net: A Hybird CNN-Transformer Network for Medical Image Segmentation
Figure 2 for MSLAU-Net: A Hybird CNN-Transformer Network for Medical Image Segmentation
Viaarxiv icon

QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models

Add code
Apr 15, 2025
Figure 1 for QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models
Figure 2 for QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models
Figure 3 for QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models
Figure 4 for QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models
Viaarxiv icon