Firstly, a new framework for the design of comprehensive, well-structured, multiple-use AV speech data corpora was proposed and followed in the production of the AVOZES data corpus. Secondly, the first publicly available, comprehensive AV speech data corpus for Australian English (AuE) was produced. In addition, it is the first AV speech data corpus to use a stereo vision system.
AVOZES contains six modules. These are:
AVOZES contains recordings from 20 native speakers of Australian English and 4 non-native speakers. Only the recordings of the native speakers are currently made available. The recordings of the non-native speakers might be published in the future.
Video recordings were made using a calibrated stereo camera system. Video frames are stored as DV-AVI files in the NTSC format (29.97Hz frame rate, 720x480 pixels resolution). Audio recordings were made using a mono microphone. Audio data are stored both in the DV-AVI files as well as in separate WAV files as 48kHz 16bit linear encoded samples.
![]() |
|
The output of the stereo cameras was multiplexed into one video signal using field multiplexing. In this technique, a device containing a video switching integrated circuit selects the signal from one video stream as the odd field of the video output, while the signal from the other video stream becomes the even field. This requires to first de-interleave the odd-even fields of the video frames from each camera. Multiplexing video signals in the analogue phase has the advantage that it can be applied to virtually any video hardware system. Images from two cameras can be stored in a single video frame, albeit at reduced vertical resolutions. Stereo image processing can be performed within the computer's memory using only one image processing board.
If you want to get a copy of AVOZES, you need to acquire a licence. There are two licences: a non-commercial (academic) licence and a commercial licence. The non-commercial licence is available for as little as AUD100 (plus postage)! Basically, just so that I cover my costs, as I am interested in making the data corpus as widely available as possible. Please make sure you checkout the wording of the licence agreement, before ordering a copy of AVOZES.
[Back to Homepage] [Back to Research]