Okey Aura Wake-up Word Dataset (Q6700129)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Okey Aura Wake-up Word Dataset |
Dataset published at Zenodo repository.
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Okey Aura Wake-up Word Dataset |
Dataset published at Zenodo repository. |
Statements
Speech dataset for wake-up word (WuW) detection in Telefnica's home assistant, Aura. It contains 1247 utterances (1.4 hours) from ~80 speakers. Speakers pronounce the wake-up word itself "Okey Aura", plus other sentences that might be similar, or not, to "Okey Aura". This dataset contains rich metadata annotations, so it is possible to study diverse factors and biases that might affect wake-up word detection performance: accent, gender, prosody/emotion, room size, distance to the microphone, etc. Besides, it also contains recordings of sentences that are phonetically similar to "Okey Aura", like "Porque Laura..." or "... como Aura...", to experiment with difficult sentences.
0 references
29 April 2024
0 references
1.1.0
0 references