Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexander Hewer

DFKI, MMCI

A Multilinear Tongue Model Derived from Speech Related MRI Data of the Human Vocal Tract

Apr 17, 2018

Alexander Hewer, Stefanie Wuhrer, Ingmar Steiner, Korin Richmond

Figure 1 for A Multilinear Tongue Model Derived from Speech Related MRI Data of the Human Vocal Tract

Figure 2 for A Multilinear Tongue Model Derived from Speech Related MRI Data of the Human Vocal Tract

Figure 3 for A Multilinear Tongue Model Derived from Speech Related MRI Data of the Human Vocal Tract

Figure 4 for A Multilinear Tongue Model Derived from Speech Related MRI Data of the Human Vocal Tract

Abstract:We present a multilinear statistical model of the human tongue that captures anatomical and tongue pose related shape variations separately. The model is derived from 3D magnetic resonance imaging data of 11 speakers sustaining speech related vocal tract configurations. The extraction is performed by using a minimally supervised method that uses as basis an image segmentation approach and a template fitting technique. Furthermore, it uses image denoising to deal with possibly corrupt data, palate surface information reconstruction to handle palatal tongue contacts, and a bootstrap strategy to refine the obtained shapes. Our evaluation concludes that limiting the degrees of freedom for the anatomical and speech related variations to 5 and 4, respectively, produces a model that can reliably register unknown data while avoiding overfitting effects. Furthermore, we show that it can be used to generate a plausible tongue animation by tracking sparse motion capture data.

* Computer Speech & Language 51 (2018) 68-92

Via

Access Paper or Ask Questions

A statistical shape space model of the palate surface trained on 3D MRI scans of the vocal tract

Sep 04, 2015

Alexander Hewer, Ingmar Steiner, Timo Bolkart, Stefanie Wuhrer, Korin Richmond

Figure 1 for A statistical shape space model of the palate surface trained on 3D MRI scans of the vocal tract

Figure 2 for A statistical shape space model of the palate surface trained on 3D MRI scans of the vocal tract

Figure 3 for A statistical shape space model of the palate surface trained on 3D MRI scans of the vocal tract

Figure 4 for A statistical shape space model of the palate surface trained on 3D MRI scans of the vocal tract

Abstract:We describe a minimally-supervised method for computing a statistical shape space model of the palate surface. The model is created from a corpus of volumetric magnetic resonance imaging (MRI) scans collected from 12 speakers. We extract a 3D mesh of the palate from each speaker, then train the model using principal component analysis (PCA). The palate model is then tested using 3D MRI from another corpus and evaluated using a high-resolution optical scan. We find that the error is low even when only a handful of measured coordinates are available. In both cases, our approach yields promising results. It can be applied to extract the palate shape from MRI data, and could be useful to other analysis modalities, such as electromagnetic articulography (EMA) and ultrasound tongue imaging (UTI).

* Proceedings of the 18th International Congress of Phonetic Sciences, Aug 2015, Glasgow, United Kingdom. 2015, http://www.icphs2015.info/

Via

Access Paper or Ask Questions