Picture for Sharath Adavanne

Sharath Adavanne

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events

Add code
Jun 15, 2023
Figure 1 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 2 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 3 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 4 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Viaarxiv icon

Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts

Add code
Nov 04, 2022
Viaarxiv icon

Context-based out-of-vocabulary word recovery for ASR systems in Indian languages

Add code
Jun 09, 2022
Figure 1 for Context-based out-of-vocabulary word recovery for ASR systems in Indian languages
Figure 2 for Context-based out-of-vocabulary word recovery for ASR systems in Indian languages
Figure 3 for Context-based out-of-vocabulary word recovery for ASR systems in Indian languages
Figure 4 for Context-based out-of-vocabulary word recovery for ASR systems in Indian languages
Viaarxiv icon

STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events

Add code
Jun 04, 2022
Figure 1 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 2 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 3 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 4 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Viaarxiv icon

Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers

Add code
Oct 29, 2021
Figure 1 for Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers
Figure 2 for Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers
Figure 3 for Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers
Viaarxiv icon

A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection

Add code
Jul 04, 2021
Figure 1 for A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Figure 2 for A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Figure 3 for A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Figure 4 for A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Viaarxiv icon

Non-native English lexicon creation for bilingual speech synthesis

Add code
Jun 21, 2021
Figure 1 for Non-native English lexicon creation for bilingual speech synthesis
Figure 2 for Non-native English lexicon creation for bilingual speech synthesis
Figure 3 for Non-native English lexicon creation for bilingual speech synthesis
Figure 4 for Non-native English lexicon creation for bilingual speech synthesis
Viaarxiv icon

Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network

Add code
Apr 29, 2019
Figure 1 for Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network
Figure 2 for Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network
Figure 3 for Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network
Figure 4 for Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network
Viaarxiv icon

Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network

Add code
Aug 05, 2018
Figure 1 for Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network
Figure 2 for Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network
Figure 3 for Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network
Figure 4 for Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network
Viaarxiv icon

Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features

Add code
Jan 29, 2018
Figure 1 for Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features
Figure 2 for Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features
Figure 3 for Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features
Figure 4 for Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features
Viaarxiv icon