Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
covertype - MaRDI portal

covertype

From MaRDI portal
Dataset:6037127



OpenML44081MaRDI QIDQ6037127

OpenML dataset with id 44081

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/22103177/covertype.arff

Upload date: 21 June 2022



Dataset Characteristics

Number of classes: 2
Number of features: 11 (numeric: 10, symbolic: 1 and in total binary: 1 )
Number of instances: 566,602
Number of instances with missing values: 0
Number of missing values: 0

Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original description:

Author: Jock A. Blackard, Dr. Denis J. Dean, Dr. Charles W. Anderson Source: LibSVM repository - 2013-11-14 Please cite: For the binarization: R. Collobert, S. Bengio, and Y. Bengio. A parallel mixture of SVMs for very large scale problems. Neural Computation, 14(05):1105-1114, 2002.

This is the famous covertype dataset in its binary version, retrieved 2013-11-13 from the libSVM site (called covtype.binary there). Additional to the preprocessing done there (see LibSVM site for details), this dataset was created as follows: -load covertpype dataset, unscaled. -normalize each file columnwise according to the following rules: -If a column only contains one value (constant feature), it will set to zero and thus removed by sparsity. -If a column contains two values (binary feature), the value occuring more often will be set to zero, the other to one. -If a column contains more than two values (multinary/real feature), the column is divided by its std deviation. -duplicate lines were finally removed.

Preprocessing: Transform from multiclass into binary class.




This page was built for dataset: covertype