The Audio-Video Australian English Speech Data Corpus AVOZES

Authors: Roland Göcke and J. Bruce Millar

Presented by Roland Göcke at the 8th International Conference on Spoken Language Processing INTERSPEECH 2004 - ICSLP, Jeju, Korea, 4-8 October 2004


This paper presents the Audio-Video Australian English Speech data corpus AVOZES. It contains recordings of 20 speakers uttering a variety of phrases. The corpus was designed for research on the statistical relationship of audio and video speech parameters with an audio-video (AV) automatic speech recognition (ASR) task in mind, but may be useful for other research tasks. AVOZES is the first published AV speaking-face data corpus for Australian English and is novel in its use of a stereo camera system for the video recordings and its modular design.

Download (99k, PDF)

[Homepage] [Research] [Publications]

(c) Roland Göcke
Last modified: Wed Nov 03 11:16:46 AUS Eastern Daylight Time 2004