AVOZES

The Audio-Video Australian English Speech Data Corpus


Contents Module 5 - Application-driven sequences - Digits 0-9 in a carrier phrase

The sequences in this module can be used as examples of applying any results, gained from an analysis of the phonemes and visemes in the "short words" module, to short sequences that are more application-driven. Digit recognition is a common task in automatic speech recognition and similar sequences can be found in a number of AV speech corpora, for example in DAVID and Tulips1.

The AVOZES data corpus includes one sequence per digit for each speaker, spoken in order from 0 to 9. Again, each digit is enclosed by the carrier phrase "You grab /DIGIT/ beer." to ensure lip closure before and after the digit for ease of segmentation of the video stream. These sequences are typically 2-3s long.

Example Sequence
Note: Any example sequence is provided for informative purposes, so that you can judge whether AVOZES is the right data corpus for you. You may use it for internal evaluation purposes only. For all other uses, including academic research, a licence must be acquired (non-commercial (academic) licence, commercial licence).


Download an example sequence (6.9MB, AVI) of "You grab ONE beer."

[Homepage] [AVOZES Homepage] [Research]


© Roland Göcke
Last modified: Tue Nov 09 17:26:22 AUS Eastern Daylight Time 2004