Okey Aura Wake-up Word Dataset (Q6700129)

Dataset published at Zenodo repository.

Language	Label	Description	Also known as
English	Okey Aura Wake-up Word Dataset	Dataset published at Zenodo repository.

Statements

instance of

data set

0 references

description

Speech dataset for wake-up word (WuW) detection in Telefnica's home assistant, Aura. It contains 1247 utterances (1.4 hours) from ~80 speakers. Speakers pronounce the wake-up word itself "Okey Aura", plus other sentences that might be similar, or not, to "Okey Aura". This dataset contains rich metadata annotations, so it is possible to study diverse factors and biases that might affect wake-up word detection performance: accent, gender, prosody/emotion, room size, distance to the microphone, etc. Besides, it also contains recordings of sentences that are phonetically similar to "Okey Aura", like "Porque Laura..." or "... como Aura...", to experiment with difficult sentences.

0 references

publication date

29 April 2024

0 references

0 references

0 references

0 references