Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scpf - MaRDI portal

scpf

From MaRDI portal
Dataset:6035522



OpenML41555MaRDI QIDQ6035522

OpenML dataset with id 41555

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/21241899/scpf.arff

Upload date: 3 April 2019



Dataset Characteristics

Number of features: 26 (numeric: 26, symbolic: 0 and in total binary: 0 )
Number of instances: 1,137
Number of instances with missing values: 994
Number of missing values: 9,255

Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : This is a pre-processed version of the dataset used in Kaggles See Click Predict Fix competition (Kaggle 2013). It concerns the prediction of three target variables that represent the number of views, clicks and comments that a specific 311 issue will receive. The issues have been collected from 4 cities (Oakland, Richmond, New Haven, Chicago) in the US and span a period of 12 months (01 2012-12 2012). The version of the dataset that we use here is a random 1 percent sample of the data. In terms of features we use the number of days that an issues stayed online, the source from where the issue was created (e.g. android, iphone, remote api, etc.), the type of the issue (e.g. graffiti, pothole, trash, etc.), the geographical co-ordinates of the issue, the city it was published from and the distance from the city center. All multi-valued nominal variables were first transformed to binary and then rare binary variables (being true for less than 1 percent of the cases) were removed.




This page was built for dataset: scpf