Picture for Giovanni Morrone

Giovanni Morrone

A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR

Add code
Sep 09, 2024
Viaarxiv icon

Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech

Add code
Jun 13, 2024
Viaarxiv icon

An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings

Add code
May 29, 2023
Viaarxiv icon

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations

Add code
Mar 21, 2023
Viaarxiv icon

Conversational Speech Separation: an Evaluation Study for Streaming Applications

Add code
May 31, 2022
Figure 1 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 2 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 3 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 4 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Viaarxiv icon

Leveraging Speech Separation for Conversational Telephone Speaker Diarization

Add code
Apr 05, 2022
Figure 1 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 2 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 3 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 4 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Viaarxiv icon

Audio-Visual Speech Inpainting with Deep Learning

Add code
Oct 09, 2020
Figure 1 for Audio-Visual Speech Inpainting with Deep Learning
Figure 2 for Audio-Visual Speech Inpainting with Deep Learning
Figure 3 for Audio-Visual Speech Inpainting with Deep Learning
Viaarxiv icon

Audio-Visual Target Speaker Extraction on Multi-Talker Environment using Event-Driven Cameras

Add code
Dec 05, 2019
Figure 1 for Audio-Visual Target Speaker Extraction on Multi-Talker Environment using Event-Driven Cameras
Figure 2 for Audio-Visual Target Speaker Extraction on Multi-Talker Environment using Event-Driven Cameras
Figure 3 for Audio-Visual Target Speaker Extraction on Multi-Talker Environment using Event-Driven Cameras
Viaarxiv icon

Joined Audio-Visual Speech Enhancement and Recognition in the Cocktail Party: The Tug Of War Between Enhancement and Recognition Losses

Add code
Apr 16, 2019
Figure 1 for Joined Audio-Visual Speech Enhancement and Recognition in the Cocktail Party: The Tug Of War Between Enhancement and Recognition Losses
Figure 2 for Joined Audio-Visual Speech Enhancement and Recognition in the Cocktail Party: The Tug Of War Between Enhancement and Recognition Losses
Figure 3 for Joined Audio-Visual Speech Enhancement and Recognition in the Cocktail Party: The Tug Of War Between Enhancement and Recognition Losses
Figure 4 for Joined Audio-Visual Speech Enhancement and Recognition in the Cocktail Party: The Tug Of War Between Enhancement and Recognition Losses
Viaarxiv icon

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

Add code
Nov 06, 2018
Figure 1 for Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Figure 2 for Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Figure 3 for Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Figure 4 for Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Viaarxiv icon