Picture for K R Prajwal

K R Prajwal

MusicFlow: Cascaded Flow Matching for Text Guided Music Generation

Add code
Oct 27, 2024
Figure 1 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 2 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 3 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 4 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Viaarxiv icon

A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision

Add code
May 16, 2024
Figure 1 for A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision
Figure 2 for A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision
Figure 3 for A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision
Figure 4 for A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision
Viaarxiv icon

Weakly-supervised Fingerspelling Recognition in British Sign Language Videos

Add code
Nov 16, 2022
Viaarxiv icon

Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild

Add code
Sep 01, 2022
Figure 1 for Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Figure 2 for Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Figure 3 for Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Figure 4 for Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Viaarxiv icon

Automatic dense annotation of large-vocabulary sign language videos

Add code
Aug 04, 2022
Figure 1 for Automatic dense annotation of large-vocabulary sign language videos
Figure 2 for Automatic dense annotation of large-vocabulary sign language videos
Figure 3 for Automatic dense annotation of large-vocabulary sign language videos
Figure 4 for Automatic dense annotation of large-vocabulary sign language videos
Viaarxiv icon

Visual Keyword Spotting with Attention

Add code
Oct 29, 2021
Figure 1 for Visual Keyword Spotting with Attention
Figure 2 for Visual Keyword Spotting with Attention
Figure 3 for Visual Keyword Spotting with Attention
Figure 4 for Visual Keyword Spotting with Attention
Viaarxiv icon

Visual Speech Enhancement Without A Real Visual Stream

Add code
Dec 20, 2020
Figure 1 for Visual Speech Enhancement Without A Real Visual Stream
Figure 2 for Visual Speech Enhancement Without A Real Visual Stream
Figure 3 for Visual Speech Enhancement Without A Real Visual Stream
Figure 4 for Visual Speech Enhancement Without A Real Visual Stream
Viaarxiv icon

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

Add code
Aug 23, 2020
Figure 1 for A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Figure 2 for A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Figure 3 for A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Figure 4 for A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Viaarxiv icon

Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis

Add code
May 17, 2020
Figure 1 for Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Figure 2 for Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Figure 3 for Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Figure 4 for Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Viaarxiv icon