Picture for Chuanxin Tang

Chuanxin Tang

MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation

Add code
Dec 05, 2024
Viaarxiv icon

Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss

Add code
Apr 12, 2023
Viaarxiv icon

Streaming Video Model

Add code
Mar 30, 2023
Viaarxiv icon

Look Before You Match: Instance Understanding Matters in Video Object Segmentation

Add code
Dec 13, 2022
Viaarxiv icon

TridentSE: Guiding Speech Enhancement with 32 Global Tokens

Add code
Oct 24, 2022
Viaarxiv icon

An Anchor-Free Detector for Continuous Speech Keyword Spotting

Add code
Aug 09, 2022
Figure 1 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Figure 2 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Figure 3 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Figure 4 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Viaarxiv icon

RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion

Add code
Jun 28, 2022
Figure 1 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 2 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 3 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 4 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Viaarxiv icon

When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism

Add code
Jan 26, 2022
Figure 1 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Figure 2 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Figure 3 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Figure 4 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Viaarxiv icon

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Add code
Sep 12, 2021
Figure 1 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Figure 2 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Figure 3 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Figure 4 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Viaarxiv icon

Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?

Add code
Sep 12, 2021
Figure 1 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 2 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 3 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 4 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Viaarxiv icon