Noise adaptive stream weighting in audio-visual speech recognition (Q1424536)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Noise adaptive stream weighting in audio-visual speech recognition |
scientific article; zbMATH DE number 2058711
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Noise adaptive stream weighting in audio-visual speech recognition |
scientific article; zbMATH DE number 2058711 |
Statements
Noise adaptive stream weighting in audio-visual speech recognition (English)
0 references
16 March 2004
0 references
Summary: It has been shown that integration of acoustic and visual information especially in noisy conditions yields improved speech recognition results. This raises the question of how to weight the two modalities in different noise conditions. Throughout this paper we develop a weighting process adaptive to various background noise situations. In the presented recognition system, audio and video data are combined following a Separate Integration architecture. A hybrid Artificial/Neural Network/Hidden Markov Model system is used for the experiments. The neural networks were in all cases trained on clean data. First, we evaluate the performance of different weighting schemes in a manually controlled recognition task with different types of noise. Next, we compare different criteria to estimate the reliability of the audio stream. Based on this, a mapping between the measurements and the free parameter of the fusion process is derived and its applicability is demonstrated. Finally, the possibilities and limitations of adaptive weighting are compared and discussed.
0 references
adaptive weighting
0 references
robust recognition
0 references
multistream recognition
0 references
0.8869515
0 references
0.8374228
0 references