Picture for Triantafyllos Afouras

Triantafyllos Afouras

MusicFlow: Cascaded Flow Matching for Text Guided Music Generation

Add code
Oct 27, 2024
Figure 1 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 2 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 3 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 4 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Nov 30, 2023
Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Video-Mined Task Graphs for Keystep Recognition in Instructional Videos

Add code
Jul 17, 2023
Viaarxiv icon

Learning to Ground Instructional Articles in Videos through Narrations

Add code
Jun 06, 2023
Viaarxiv icon

Scaling up sign spotting through sign language dictionaries

Add code
May 09, 2022
Figure 1 for Scaling up sign spotting through sign language dictionaries
Figure 2 for Scaling up sign spotting through sign language dictionaries
Figure 3 for Scaling up sign spotting through sign language dictionaries
Figure 4 for Scaling up sign spotting through sign language dictionaries
Viaarxiv icon

Audio-Visual Synchronisation in the wild

Add code
Dec 08, 2021
Figure 1 for Audio-Visual Synchronisation in the wild
Figure 2 for Audio-Visual Synchronisation in the wild
Figure 3 for Audio-Visual Synchronisation in the wild
Figure 4 for Audio-Visual Synchronisation in the wild
Viaarxiv icon

BBC-Oxford British Sign Language Dataset

Add code
Nov 05, 2021
Figure 1 for BBC-Oxford British Sign Language Dataset
Figure 2 for BBC-Oxford British Sign Language Dataset
Figure 3 for BBC-Oxford British Sign Language Dataset
Figure 4 for BBC-Oxford British Sign Language Dataset
Viaarxiv icon

Visual Keyword Spotting with Attention

Add code
Oct 29, 2021
Figure 1 for Visual Keyword Spotting with Attention
Figure 2 for Visual Keyword Spotting with Attention
Figure 3 for Visual Keyword Spotting with Attention
Figure 4 for Visual Keyword Spotting with Attention
Viaarxiv icon

Sub-word Level Lip Reading With Visual Attention

Add code
Oct 14, 2021
Figure 1 for Sub-word Level Lip Reading With Visual Attention
Figure 2 for Sub-word Level Lip Reading With Visual Attention
Figure 3 for Sub-word Level Lip Reading With Visual Attention
Figure 4 for Sub-word Level Lip Reading With Visual Attention
Viaarxiv icon

Aligning Subtitles in Sign Language Videos

Add code
May 06, 2021
Figure 1 for Aligning Subtitles in Sign Language Videos
Figure 2 for Aligning Subtitles in Sign Language Videos
Figure 3 for Aligning Subtitles in Sign Language Videos
Figure 4 for Aligning Subtitles in Sign Language Videos
Viaarxiv icon