Picture for Zhen Lei

Zhen Lei

PC-Talk: Precise Facial Animation Control for Audio-Driven Talking Face Generation

Add code
Mar 20, 2025
Viaarxiv icon

Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport

Add code
Mar 19, 2025
Viaarxiv icon

Bayesian Test-Time Adaptation for Vision-Language Models

Add code
Mar 12, 2025
Viaarxiv icon

SRM-Hair: Single Image Head Mesh Reconstruction via 3D Morphable Hair

Add code
Mar 08, 2025
Viaarxiv icon

EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery

Add code
Jan 20, 2025
Figure 1 for EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery
Figure 2 for EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery
Figure 3 for EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery
Figure 4 for EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery
Viaarxiv icon

WMamba: Wavelet-based Mamba for Face Forgery Detection

Add code
Jan 16, 2025
Figure 1 for WMamba: Wavelet-based Mamba for Face Forgery Detection
Figure 2 for WMamba: Wavelet-based Mamba for Face Forgery Detection
Figure 3 for WMamba: Wavelet-based Mamba for Face Forgery Detection
Figure 4 for WMamba: Wavelet-based Mamba for Face Forgery Detection
Viaarxiv icon

CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection

Add code
Jan 11, 2025
Figure 1 for CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection
Figure 2 for CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection
Figure 3 for CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection
Figure 4 for CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection
Viaarxiv icon

Concept Discovery in Deep Neural Networks for Explainable Face Anti-Spoofing

Add code
Dec 23, 2024
Viaarxiv icon

RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection

Add code
Dec 17, 2024
Viaarxiv icon

Second FRCSyn-onGoing: Winning Solutions and Post-Challenge Analysis to Improve Face Recognition with Synthetic Data

Add code
Dec 02, 2024
Viaarxiv icon