Picture for Lei Zhao

Lei Zhao

Soochow University

RoboBERT: An End-to-end Multimodal Robotic Manipulation Model

Add code
Feb 11, 2025
Viaarxiv icon

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

Add code
Feb 11, 2025
Viaarxiv icon

UniForm: A Unified Diffusion Transformer for Audio-Video Generation

Add code
Feb 08, 2025
Viaarxiv icon

MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer

Add code
Jan 07, 2025
Figure 1 for MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer
Figure 2 for MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer
Figure 3 for MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer
Figure 4 for MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer
Viaarxiv icon

Better Knowledge Enhancement for Privacy-Preserving Cross-Project Defect Prediction

Add code
Dec 23, 2024
Viaarxiv icon

GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot

Add code
Dec 03, 2024
Viaarxiv icon

Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey

Add code
Nov 30, 2024
Figure 1 for Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey
Figure 2 for Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey
Figure 3 for Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey
Figure 4 for Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey
Viaarxiv icon

PSFHS Challenge Report: Pubic Symphysis and Fetal Head Segmentation from Intrapartum Ultrasound Images

Add code
Sep 17, 2024
Figure 1 for PSFHS Challenge Report: Pubic Symphysis and Fetal Head Segmentation from Intrapartum Ultrasound Images
Figure 2 for PSFHS Challenge Report: Pubic Symphysis and Fetal Head Segmentation from Intrapartum Ultrasound Images
Figure 3 for PSFHS Challenge Report: Pubic Symphysis and Fetal Head Segmentation from Intrapartum Ultrasound Images
Figure 4 for PSFHS Challenge Report: Pubic Symphysis and Fetal Head Segmentation from Intrapartum Ultrasound Images
Viaarxiv icon

CogVLM2: Visual Language Models for Image and Video Understanding

Add code
Aug 29, 2024
Figure 1 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 2 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 3 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 4 for CogVLM2: Visual Language Models for Image and Video Understanding
Viaarxiv icon

Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model

Add code
Aug 24, 2024
Viaarxiv icon