The Audio-Video Australian English Speech Data Corpus AVOZES

Authors: Roland Göcke and J. Bruce Millar
Presented by Roland Göcke at the 8th International Conference on Spoken Language Processing INTERSPEECH 2004 - ICSLP, Jeju, Korea, 4-8 October 2004

Abstract

This paper presents the Audio-Video Australian English Speech data corpus AVOZES. It contains recordings of 20 speakers uttering a variety of phrases. The corpus was designed for research on the statistical relationship of audio and video speech parameters with an audio-video (AV) automatic speech recognition (ASR) task in mind, but may be useful for other research tasks. AVOZES is the first published AV speaking-face data corpus for Australian English and is novel in its use of a stereo camera system for the video recordings and its modular design.

Download (99k, PDF)

[Homepage] [Research] [Publications]

The Audio-Video Australian English Speech Data Corpus AVOZES

Authors: Roland Göcke and J. Bruce Millar Presented by Roland Göcke at the 8th International Conference on Spoken Language Processing INTERSPEECH 2004 - ICSLP, Jeju, Korea, 4-8 October 2004

Abstract

Authors: Roland Göcke and J. Bruce Millar
Presented by Roland Göcke at the 8th International Conference on Spoken Language Processing INTERSPEECH 2004 - ICSLP, Jeju, Korea, 4-8 October 2004