Discrete and dual tree wavelet features for real-time speech/music discrimination (Q542676)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Discrete and dual tree wavelet features for real-time speech/music discrimination |
scientific article; zbMATH DE number 5907346
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Discrete and dual tree wavelet features for real-time speech/music discrimination |
scientific article; zbMATH DE number 5907346 |
Statements
Discrete and dual tree wavelet features for real-time speech/music discrimination (English)
0 references
14 June 2011
0 references
Summary: The performance of wavelet transform-based features for the speech/music discrimination task has been investigated. In order to extract wavelet domain features, discrete and complex orthogonal wavelet transforms have been used. The performance of the proposed feature set has been compared with a feature set constructed from the most common time, frequency and cepstral domain features such as number of zero crossings, spectral centroid, spectral flux, and Mel cepstral coefficients. The artificial neural networks have been used as classification tool. The principal component analysis has been applied to eliminate the correlated features before the classification stage. For discrete wavelet transform, considering the number of vanishing moments and orthogonality, the best performance is obtained with Daubechies~8 wavelet among the other members of the Daubechies family. The dual tree wavelet transform has also demonstrated a successful performance both in terms of accuracy and time consumption. Finally, a real-time discrimination system has been implemented using the Daubechies~8 wavelet which has the best accuracy.
0 references
discrimination
0 references
discrete wavelet transform
0 references
db8 wavelet
0 references