Picture for László Tóth

László Tóth

Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks

Add code
May 31, 2023
Figure 1 for Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks
Figure 2 for Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks
Figure 3 for Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks
Figure 4 for Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks
Viaarxiv icon

Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging

Add code
Jul 26, 2021
Figure 1 for Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging
Figure 2 for Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging
Figure 3 for Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging
Figure 4 for Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging
Viaarxiv icon

Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input

Add code
Jul 05, 2021
Figure 1 for Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input
Figure 2 for Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input
Figure 3 for Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input
Figure 4 for Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input
Viaarxiv icon

Voice Activity Detection for Ultrasound-based Silent Speech Interfaces using Convolutional Neural Networks

Add code
Jun 03, 2021
Figure 1 for Voice Activity Detection for Ultrasound-based Silent Speech Interfaces using Convolutional Neural Networks
Figure 2 for Voice Activity Detection for Ultrasound-based Silent Speech Interfaces using Convolutional Neural Networks
Figure 3 for Voice Activity Detection for Ultrasound-based Silent Speech Interfaces using Convolutional Neural Networks
Figure 4 for Voice Activity Detection for Ultrasound-based Silent Speech Interfaces using Convolutional Neural Networks
Viaarxiv icon

Improving Neural Silent Speech Interface Models by Adversarial Training

Add code
Apr 23, 2021
Figure 1 for Improving Neural Silent Speech Interface Models by Adversarial Training
Figure 2 for Improving Neural Silent Speech Interface Models by Adversarial Training
Figure 3 for Improving Neural Silent Speech Interface Models by Adversarial Training
Figure 4 for Improving Neural Silent Speech Interface Models by Adversarial Training
Viaarxiv icon

Reconstructing Speech from Real-Time Articulatory MRI Using Neural Vocoders

Add code
Apr 23, 2021
Figure 1 for Reconstructing Speech from Real-Time Articulatory MRI Using Neural Vocoders
Figure 2 for Reconstructing Speech from Real-Time Articulatory MRI Using Neural Vocoders
Figure 3 for Reconstructing Speech from Real-Time Articulatory MRI Using Neural Vocoders
Figure 4 for Reconstructing Speech from Real-Time Articulatory MRI Using Neural Vocoders
Viaarxiv icon

3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces

Add code
Apr 23, 2021
Figure 1 for 3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces
Figure 2 for 3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces
Figure 3 for 3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces
Figure 4 for 3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces
Viaarxiv icon

Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks

Add code
Aug 07, 2020
Figure 1 for Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks
Figure 2 for Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks
Figure 3 for Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks
Viaarxiv icon

GMM-Free Flat Start Sequence-Discriminative DNN Training

Add code
Oct 11, 2016
Figure 1 for GMM-Free Flat Start Sequence-Discriminative DNN Training
Figure 2 for GMM-Free Flat Start Sequence-Discriminative DNN Training
Figure 3 for GMM-Free Flat Start Sequence-Discriminative DNN Training
Viaarxiv icon