Picture for Lei Zhu

Lei Zhu

LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer

Add code
Sep 26, 2025
Viaarxiv icon

Toward Medical Deepfake Detection: A Comprehensive Dataset and Novel Method

Add code
Sep 19, 2025
Viaarxiv icon

HybridMamba: A Dual-domain Mamba for 3D Medical Image Segmentation

Add code
Sep 18, 2025
Viaarxiv icon

Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance

Add code
Aug 10, 2025
Viaarxiv icon

EventRR: Event Referential Reasoning for Referring Video Object Segmentation

Add code
Aug 10, 2025
Viaarxiv icon

S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything without Supervision

Add code
Aug 09, 2025
Viaarxiv icon

Unified modality separation: A vision-language framework for unsupervised domain adaptation

Add code
Aug 07, 2025
Viaarxiv icon

HRVVS: A High-resolution Video Vasculature Segmentation Network via Hierarchical Autoregressive Residual Priors

Add code
Jul 30, 2025
Viaarxiv icon

MaskedCLIP: Bridging the Masked and CLIP Space for Semi-Supervised Medical Vision-Language Pre-training

Add code
Jul 23, 2025
Viaarxiv icon

The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs

Add code
Jul 10, 2025
Viaarxiv icon