Picture for Feilong Chen

Feilong Chen

AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding

Add code
May 06, 2024
Viaarxiv icon

DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder

Add code
Nov 03, 2023
Viaarxiv icon

ViLaS: Integrating Vision and Language into Automatic Speech Recognition

Add code
May 31, 2023
Figure 1 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 2 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 3 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 4 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Viaarxiv icon

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

Add code
May 10, 2023
Figure 1 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 2 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 3 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 4 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Viaarxiv icon

Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation

Add code
Jan 30, 2023
Figure 1 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Figure 2 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Figure 3 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Figure 4 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Viaarxiv icon

An Online Sparse Streaming Feature Selection Algorithm

Add code
Aug 03, 2022
Figure 1 for An Online Sparse Streaming Feature Selection Algorithm
Figure 2 for An Online Sparse Streaming Feature Selection Algorithm
Figure 3 for An Online Sparse Streaming Feature Selection Algorithm
Figure 4 for An Online Sparse Streaming Feature Selection Algorithm
Viaarxiv icon

HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval

Add code
May 31, 2022
Figure 1 for HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval
Figure 2 for HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval
Figure 3 for HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval
Figure 4 for HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval
Viaarxiv icon

Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning

Add code
Apr 15, 2022
Figure 1 for Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning
Figure 2 for Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning
Figure 3 for Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning
Figure 4 for Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning
Viaarxiv icon

VLP: A Survey on Vision-Language Pre-training

Add code
Feb 21, 2022
Figure 1 for VLP: A Survey on Vision-Language Pre-training
Figure 2 for VLP: A Survey on Vision-Language Pre-training
Figure 3 for VLP: A Survey on Vision-Language Pre-training
Viaarxiv icon

Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation

Add code
Sep 17, 2021
Figure 1 for Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Figure 2 for Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Figure 3 for Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Figure 4 for Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Viaarxiv icon