Statistical Analysis of the Relationship between Audio and Video Speech Parameters for Australian English

Authors: Roland Göcke and J. Bruce Millar
Presented by Roland Göcke at the ISCA Tutorial and Research Workshop on Auditory-Visual Speech Processing AVSP 2003, St. Jorioz, France, 4-7 September 2003

Abstract

After decades of research, automatic speech processing has become more and more viable in recent years. Audio-video speech recognition has been shown to improve the recognition rate in noise-degraded environments. However, which audio and video speech parameters to choose for an optimal system and how they are related is still an open research issue. Here we present a number of statistical analyses that aim at increasing our understanding of such audio-video relationships. In particular, we look at the canonical correlation analysis and the coinertia analysis which investigate the relationship of linear combinations of parameters. The analyses are performed on Australian English as an example.

Download (211k, PDF)

[Homepage] [Research] [Publications]

Statistical Analysis of the Relationship between Audio and Video Speech Parameters for Australian English

Authors: Roland Göcke and J. Bruce Millar Presented by Roland Göcke at the ISCA Tutorial and Research Workshop on Auditory-Visual Speech Processing AVSP 2003, St. Jorioz, France, 4-7 September 2003

Abstract

Authors: Roland Göcke and J. Bruce Millar
Presented by Roland Göcke at the ISCA Tutorial and Research Workshop on Auditory-Visual Speech Processing AVSP 2003, St. Jorioz, France, 4-7 September 2003