The Audio-Video Australian English Speech Data Corpus AVOZES
Authors: Roland Göcke and J. Bruce Millar
Presented by Roland Göcke at the 8th International Conference on
Spoken Language Processing INTERSPEECH 2004 - ICSLP, Jeju, Korea,
4-8 October 2004
Abstract
This paper presents the Audio-Video Australian English Speech data corpus
AVOZES. It contains recordings of 20 speakers uttering a variety of
phrases. The corpus was designed for research on the statistical
relationship of audio and video speech parameters with an audio-video (AV)
automatic speech recognition (ASR) task in mind, but may be useful for
other research tasks. AVOZES is the first published AV speaking-face data
corpus for Australian English and is novel in its use of a stereo camera
system for the video recordings and its modular design.
Download (99k, PDF)
[Homepage]
[Research]
[Publications]
(c) Roland Göcke
Last modified: Wed Nov 03 11:16:46 AUS Eastern Daylight Time 2004