Statistical Analysis of the Relationship between Audio and Video Speech Parameters for Australian English
Authors: Roland Göcke and J. Bruce Millar
Presented by Roland Göcke at the ISCA Tutorial and Research
Workshop on Auditory-Visual Speech Processing AVSP 2003, St. Jorioz,
France, 4-7 September 2003
Abstract
After decades of research, automatic speech processing has
become more and more viable in recent years. Audio-video
speech recognition has been shown to improve the recognition
rate in noise-degraded environments. However, which audio
and video speech parameters to choose for an optimal system
and how they are related is still an open research issue. Here we
present a number of statistical analyses that aim at increasing
our understanding of such audio-video relationships. In particular,
we look at the canonical correlation analysis and the
coinertia analysis which investigate the relationship of linear
combinations of parameters. The analyses are performed on
Australian English as an example.
Download (211k, PDF)
[Homepage]
[Research]
[Publications]
(c) Roland Göcke
Last modified: Wed Nov 03 18:11:54 AUS Eastern Daylight Time 2004