Picture for Minglei Li

Minglei Li

SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing

Add code
Sep 05, 2024
Figure 1 for SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing
Figure 2 for SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing
Figure 3 for SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing
Figure 4 for SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing
Viaarxiv icon

Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation

Add code
Aug 03, 2024
Figure 1 for Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation
Figure 2 for Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation
Figure 3 for Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation
Figure 4 for Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation
Viaarxiv icon

Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision

Add code
Jun 06, 2024
Viaarxiv icon

Multi-Channel Multi-Step Spectrum Prediction Using Transformer and Stacked Bi-LSTM

Add code
May 29, 2024
Viaarxiv icon

Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model

Add code
Apr 02, 2024
Figure 1 for Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Figure 2 for Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Figure 3 for Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Figure 4 for Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Viaarxiv icon

Syllable based DNN-HMM Cantonese Speech to Text System

Add code
Feb 13, 2024
Viaarxiv icon

Partial Fine-Tuning: A Successor to Full Fine-Tuning for Vision Transformers

Add code
Dec 25, 2023
Viaarxiv icon

TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation

Add code
Dec 23, 2023
Viaarxiv icon

Practical Deep Dispersed Watermarking with Synchronization and Fusion

Add code
Oct 23, 2023
Figure 1 for Practical Deep Dispersed Watermarking with Synchronization and Fusion
Figure 2 for Practical Deep Dispersed Watermarking with Synchronization and Fusion
Figure 3 for Practical Deep Dispersed Watermarking with Synchronization and Fusion
Figure 4 for Practical Deep Dispersed Watermarking with Synchronization and Fusion
Viaarxiv icon

UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons

Add code
Sep 13, 2023
Viaarxiv icon