Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Edward Golob

Stuttering Speech Disfluency Prediction using Explainable Attribution Vectors of Facial Muscle Movements

Oct 02, 2020

Arun Das, Jeffrey Mock, Henry Chacon, Farzan Irani, Edward Golob, Peyman Najafirad

Figure 1 for Stuttering Speech Disfluency Prediction using Explainable Attribution Vectors of Facial Muscle Movements

Figure 2 for Stuttering Speech Disfluency Prediction using Explainable Attribution Vectors of Facial Muscle Movements

Figure 3 for Stuttering Speech Disfluency Prediction using Explainable Attribution Vectors of Facial Muscle Movements

Figure 4 for Stuttering Speech Disfluency Prediction using Explainable Attribution Vectors of Facial Muscle Movements

Abstract:Speech disorders such as stuttering disrupt the normal fluency of speech by involuntary repetitions, prolongations and blocking of sounds and syllables. In addition to these disruptions to speech fluency, most adults who stutter (AWS) also experience numerous observable secondary behaviors before, during, and after a stuttering moment, often involving the facial muscles. Recent studies have explored automatic detection of stuttering using Artificial Intelligence (AI) based algorithm from respiratory rate, audio, etc. during speech utterance. However, most methods require controlled environments and/or invasive wearable sensors, and are unable explain why a decision (fluent vs stuttered) was made. We hypothesize that pre-speech facial activity in AWS, which can be captured non-invasively, contains enough information to accurately classify the upcoming utterance as either fluent or stuttered. Towards this end, this paper proposes a novel explainable AI (XAI) assisted convolutional neural network (CNN) classifier to predict near future stuttering by learning temporal facial muscle movement patterns of AWS and explains the important facial muscles and actions involved. Statistical analyses reveal significantly high prevalence of cheek muscles (p<0.005) and lip muscles (p<0.005) to predict stuttering and shows a behavior conducive of arousal and anticipation to speak. The temporal study of these upper and lower facial muscles may facilitate early detection of stuttering, promote automated assessment of stuttering and have application in behavioral therapies by providing automatic non-invasive feedback in realtime.

* Submitting to IEEE Trans. 10 pages, 7 figures. Final Manuscript

Via

Access Paper or Ask Questions