Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bruce Denby

UPMC

Updating the silent speech challenge benchmark with deep learning

Sep 20, 2017

Yan Ji, Licheng Liu, Hongcui Wang, Zhilei Liu, Zhibin Niu, Bruce Denby

Figure 1 for Updating the silent speech challenge benchmark with deep learning

Figure 2 for Updating the silent speech challenge benchmark with deep learning

Figure 3 for Updating the silent speech challenge benchmark with deep learning

Figure 4 for Updating the silent speech challenge benchmark with deep learning

Abstract:The 2010 Silent Speech Challenge benchmark is updated with new results obtained in a Deep Learning strategy, using the same input features and decoding strategy as in the original article. A Word Error Rate of 6.4% is obtained, compared to the published value of 17.4%. Additional results comparing new auto-encoder-based features with the original features at reduced dimensionality, as well as decoding scenarios on two different language models, are also presented. The Silent Speech Challenge archive has been updated to contain both the original and the new auto-encoder features, in addition to the original raw data.

* 25 pages, 6 pages

Via

Access Paper or Ask Questions

Development of a 3D tongue motion visualization platform based on ultrasound image sequences

May 19, 2016

Kele Xu, Yin Yang, Aurore Jaumard-Hakoun, Clemence Leboullenger, Gerard Dreyfus, Pierre Roussel, Maureen Stone, Bruce Denby

Figure 1 for Development of a 3D tongue motion visualization platform based on ultrasound image sequences

Figure 2 for Development of a 3D tongue motion visualization platform based on ultrasound image sequences

Figure 3 for Development of a 3D tongue motion visualization platform based on ultrasound image sequences

Figure 4 for Development of a 3D tongue motion visualization platform based on ultrasound image sequences

Abstract:This article describes the development of a platform designed to visualize the 3D motion of the tongue using ultrasound image sequences. An overview of the system design is given and promising results are presented. Compared to the analysis of motion in 2D image sequences, such a system can provide additional visual information and a quantitative description of the tongue 3D motion. The platform can be useful in a variety of fields, such as speech production, articulation training, etc.

* 5 Pages, 5 figures, published in 18th International Congress of Phonetic Sciences, 2015

Via

Access Paper or Ask Questions

Contour-based 3d tongue motion visualization using ultrasound image sequences

May 19, 2016

Kele Xu, Yin Yang, Clémence Leboullenger, Pierre Roussel, Bruce Denby

Figure 1 for Contour-based 3d tongue motion visualization using ultrasound image sequences

Figure 2 for Contour-based 3d tongue motion visualization using ultrasound image sequences

Figure 3 for Contour-based 3d tongue motion visualization using ultrasound image sequences

Figure 4 for Contour-based 3d tongue motion visualization using ultrasound image sequences

Abstract:This article describes a contour-based 3D tongue deformation visualization framework using B-mode ultrasound image sequences. A robust, automatic tracking algorithm characterizes tongue motion via a contour, which is then used to drive a generic 3D Finite Element Model (FEM). A novel contour-based 3D dynamic modeling method is presented. Modal reduction and modal warping techniques are applied to model the deformation of the tongue physically and efficiently. This work can be helpful in a variety of fields, such as speech production, silent speech recognition, articulation training, speech disorder study, etc.

* ICASSP 2016, Mar 2016, SHANGHAI, China

Via

Access Paper or Ask Questions

Tongue contour extraction from ultrasound images based on deep neural network

May 19, 2016

Aurore Jaumard-Hakoun, Kele Xu, Pierre Roussel-Ragot, Gérard Dreyfus, Bruce Denby

Figure 1 for Tongue contour extraction from ultrasound images based on deep neural network

Figure 2 for Tongue contour extraction from ultrasound images based on deep neural network

Figure 3 for Tongue contour extraction from ultrasound images based on deep neural network

Figure 4 for Tongue contour extraction from ultrasound images based on deep neural network

Abstract:Studying tongue motion during speech using ultrasound is a standard procedure, but automatic ultrasound image labelling remains a challenge, as standard tongue shape extraction methods typically require human intervention. This article presents a method based on deep neural networks to automatically extract tongue contour from ultrasound images on a speech dataset. We use a deep autoencoder trained to learn the relationship between an image and its related contour, so that the model is able to automatically reconstruct contours from the ultrasound image alone. In this paper, we use an automatic labelling algorithm instead of time-consuming hand-labelling during the training process, and estimate the performances of both automatic labelling and contour extraction as compared to hand-labelling. Observed results show quality scores comparable to the state of the art.

* 5 pages, 3 figures, published in The International Congress of Phonetic Sciences, 2015

Via

Access Paper or Ask Questions