OK Aura Wake-up Word Dataset (Q6700106)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: OK Aura Wake-up Word Dataset |
Dataset published at Zenodo repository.
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | OK Aura Wake-up Word Dataset |
Dataset published at Zenodo repository. |
Statements
Speech dataset for wake-up word (WuW) detection in Telefnicas home assistant, Aura. It contains 1247 utterances (1.4 hours) from ~80 speakers. Speakers pronounce the wake-up word itself OK Aura, plus other sentences that might be similar, or not, to OK Aura. This dataset contains rich metadata annotations, so it is possible to study diverse factors and biases that might affect wake-up word detection performance: accent, gender, prosody/emotion,room size, distance to the microphone, etc. Besides, it also contains recordings of sentences that are phonetically similar to OK Aura, like Porque Laura... or ... como Aura..., with the purpose to experiment with difficult sentences.
0 references
29 November 2021
0 references
1.0.0
0 references