UnivIS
Information system of Friedrich-Alexander-University Erlangen-Nuremberg © Config eG 
FAU Logo
  Collection/class schedule    module collection Home  |  Legal Matters  |  Contact  |  Help    
search:      semester:   
 
 Layout
 
printable version

 
 
Analysis and Evaluation of Tracheoesophageal Substitute Voices (SVcheck)

Tracheoesophageal (TE) speech is a possibility to restore the ability to speak after laryngectomy, i.e. the removal of the larynx. A shunt valve between the trachea and the esophagus can divert the air stream from the lungs to the pharyngoesophageal segment where tissue vibrations are the source of the substitute voice. During therapy the patient's voice has to be evaluated from time to time. Changes in criteria like volume, intelligibility and use of prosody have to be found and reported. The evaluation is subjectively and the method is time-consuming and expensive for doctor and patient. Therefore an automatic and objective method would be desirable.

In our work we examine how TE speech is recognized by an automatic speech recognition system and if the quality of the substitute voice can (partially) be evaluated automatically. For this goal the evaluation results of the automatic system and a human expert group have to correlate. The patients' self-evaluation (SF-36, V-RQOL, VHI, Trierer Skalen) will also be part of an automatically computed measure which describes the quality of the substitute voice.

The speech data collection from the patients does not only take into account spontaneous speech from a close-talking microphone but also recordings via telephone.

Up to now we found a high correlation between the raters' intelligibility score for the patients' continuous speech and the word accuracy computed by an automatic speech recognizer both for close-talk recordings (r=-0.88) and for telephone speech (r=-0.80). Similar correlations could be obtained between automatically computed prosodic features and the rating criteria "match of breath and sense units" and "speaking effort". The visualization of speech disorders was successfully done using the Sammon transform.

Project manager:
Prof. Dr. med. Frank Rosanowski

Project participants:
Prof. Dr. med., Dr. rer. nat. Ulrich Eysholdt, im Ruhestand, Prof. Dr.-Ing. Elmar Nöth, Dr.-Ing. Dr. habil. med. Tino Haderlein

Keywords:
laryngectomy; substitute voice; automatic speech processing

Duration: 1.4.2005 - 31.3.2007

Sponsored by:
Deutsche Krebshilfe

Mitwirkende Institutionen:
Abteilung für Phoniatrie und Pädaudiologie
Lehrstuhl für Mustererkennung

Contact:
Nöth, Elmar
Phone +49 9131 85 27888, Fax +49 9131 85 27270, E-Mail: elmar.noeth@fau.de
Publications
Schuster, Maria ; Haderlein, Tino ; Nöth, Elmar ; Lohscheller, Jörg ; Eysholdt, Ulrich ; Rosanowski, Frank: Intelligibility of laryngectomees' substitute speech: automatic speech recognition and subjective rating. In: Eur Arch Otorhinolaryngol 263 (2006), No. 2, pp 188-193
Haderlein, Tino ; Nöth, Elmar ; Schuster, Maria ; Eysholdt, Ulrich ; Rosanowski, Frank: Evaluation of Tracheoesophageal Substitute Voices Using Prosodic Features. In: Hoffmann, Rüdiger ; Mixdorff, Hansjörg (Ed.) : Proc. Speech Prosody, 3rd International Conference (Speech Prosody, 3rd International Conference Dresden 2.5.-5.5.2006). Vol. 1. Dresden : TUDpress, 2006, pp 701-704. - ISBN 3-938863-57-9
Haderlein, Tino ; Zorn, Dominik ; Steidl, Stefan ; Nöth, Elmar ; Shozakai, Makoto ; Schuster, Maria: Visualization of Voice Disorders Using the Sammon Transform. In: Sojka, Petr ; Kopecek, Ivan ; Pala, Karel (Ed.) : Proc. Text, Speech and Dialogue; 9th International Conference (Text, Speech and Dialogue; 9th International Conference (TSD 2006) Brno, Tschechien 11.9.-15.9.2006). Vol. 1. Berlin : Springer, 2006, pp 589-596. (Lecture Notes in Artificial Intelligence) - ISBN 3-540-39090-1
Riedhammer, Korbinian ; Haderlein, Tino ; Schuster, Maria ; Rosanowski, Frank ; Nöth, Elmar: Automatic Evaluation of Tracheoesophageal Telephone Speech. In: Erjavec, Tomaz ; Zganec Gros, Jerneja (Ed.) : Proceedings of the 5th Slovenian and 1st International Conference Language Technologies IS-LTC 2006 (5th Slovenian and 1st International Conference Language Technologies IS-LTC 2006 Ljubljana, Slowenien 9.10.-10.10.2006). Vol. 1. Ljubljana, Slowenien : Biografika BORI d.o.o., 2006, pp 17-22. - ISBN 978-961-6303-83-5
Haderlein, Tino ; Riedhammer, Korbinian ; Maier, Andreas ; Nöth, Elmar ; Toy, Hikmet ; Rosanowski, Frank: An Automatic Version of the Post-Laryngectomy Telephone Test. In: Matousek, Vaclav ; Mautner, Pavel (Ed.) : Proc. Text, Speech and Dialogue; 10th International Conference (Text, Speech and Dialogue; 10th International Conference (TSD 2007) Pilsen, Tschechien 3.-7.9.2007). Vol. 1. Berlin : Springer, 2007, pp 238-245. (Lecture Notes in Artificial Intelligence Vol. 4629) - ISBN 978-3-540-74627-0
Haderlein, Tino ; Nöth, Elmar ; Toy, Hikmet ; Batliner, Anton ; Schuster, Maria ; Eysholdt, Ulrich ; Hornegger, Joachim ; Rosanowski, Frank: Automatic Evaluation of Prosodic Features of Tracheoesophageal Substitute Voice. In: Eur Arch Otorhinolaryngol 264 (2007), No. 11, pp 1315-1321

Institution: Chair of Computer Science 5 (Pattern Recognition)
UnivIS is a product of Config eG, Buckenhof