KDD98
From MaRDI portal
Dataset:6035909
OpenML dataset with id 42343
No author found.
Full work available at URL: https://api.openml.org/data/v1/download/21801022/KDD98.arff
Upload date: 3 April 2020
Dataset Characteristics
Number of classes: 2
Number of features: 478 (numeric: 341, symbolic: 137 and in total binary: 30 )
Number of instances: 82,318
Number of instances with missing values: 82,318
Number of missing values: 2,399,311
Dataset KDD98 challenge: https://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html
The goal is to estimate the return from a direct mailing in order to maximize donation profits.
This dataset represents problem of binary classification - whether there was a response to mailing. For this version, the target was correctly encoded as a binary factor. The features 'HPHONE_D', 'MHUC2', 'INCOME', 'WEALTH1', 'WEALTH2' were recoded as nominal factor variables and the constant feature 'RFA_2R' was removed from the dataset. For this version, the majority class was downsampled to 40% of the original size. Unused factor levels were dropped.
This page was built for dataset: KDD98