Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
fri_c3_100_5 - MaRDI portal

Deprecated: Use of MediaWiki\Skin\SkinTemplate::injectLegacyMenusIntoPersonalTools was deprecated in Please make sure Skin option menus contains `user-menu` (and possibly `notifications`, `user-interface-preferences`, `user-page`) 1.46. [Called from MediaWiki\Skin\SkinTemplate::getPortletsTemplateData in /var/www/html/w/includes/Skin/SkinTemplate.php at line 691] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of MediaWiki\Skin\BaseTemplate::getPersonalTools was deprecated in 1.46 Call $this->getSkin()->getPersonalToolsForMakeListItem instead (T422975). [Called from Skins\Chameleon\Components\NavbarHorizontal\PersonalTools::getHtml in /var/www/html/w/skins/chameleon/src/Components/NavbarHorizontal/PersonalTools.php at line 66] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of QuickTemplate::(get/html/text/haveData) with parameter `personal_urls` was deprecated in MediaWiki Use content_navigation instead. [Called from MediaWiki\Skin\QuickTemplate::get in /var/www/html/w/includes/Skin/QuickTemplate.php at line 131] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

fri_c3_100_5

From MaRDI portal
Dataset:6033346



OpenML611MaRDI QIDQ6033346

OpenML dataset with id 611

Author name not available (Why is that?)

Full work available at URL: https://api.openml.org/data/v1/download/1390116/fri_c3_100_5.arff

Upload date: 4 October 2014



Dataset Characteristics

Number of classes: 0
Number of features: 6 (numeric: 6, symbolic: 0 and in total binary: 0 )
Number of instances: 100
Number of instances with missing values: 0
Number of missing values: 0

Author: Source: Unknown - Date unknown Please cite:

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting

The dataset names are coded as "fri_colinearintydegree_samplenumber_featurenumber".

Friedman is the one of the most used functions for data generation (Friedman, 1999). Friedman functions include both linear and non-linear relations between input and output, and a normalized noise (e) is added to the output. The Friedman function is as follows:

y=10*sin(pi*x1*x2)+20*(x3-0.5)^2=10*X4+5*X5+e

In the original Friedman function, there are 5 features for input. To measure the effects of non-related features, additional features are added to the datasets. These added features are independent from the output. However, to measure the algorithm's robustness to the colinearity, the datasets are generated with 5 different colinearity degrees. The colinearity degrees is the number of features depending on other features.

The generated Friedman dataset's parameters and values are given below: The number of features: 5 10 25 50 100 (only the first 5 features are related to the output. The rest are completely random) The number of samples: 100 250 500 1000 Colinearity degrees: 0 1 2 3 4 For the datasets with colinearity degree 4, the numbers of features are generated as 10, 25, 50 and 100. The other datasets have 5, 10, 25 and 50 features.

As a result, 80 artificial datasets are generated by (4 different feature number * 4 different sample number * 5 different colinearity degree)

The last attribute in each file is the target.






This page was built for dataset: fri_c3_100_5