hcdr_main
OpenML dataset with id 45567
No author found.
Full work available at URL: https://api.openml.org/data/v1/download/22116545/hcdr_main.arff
Upload date: 6 June 2023
Dataset Characteristics
Number of classes: 2
Number of features: 121 (numeric: 72, symbolic: 49 and in total binary: 37 )
Number of instances: 307,511
Number of instances with missing values: 298,909
Number of missing values: 9,152,465
Home Credit Default Risk Main Table
WARNING: This is only the main table of the competition' training dataset! Please do not use it alone (but rather use all data available on Kaggle) unless you aim to reproduce the results of:
> Huang, X., Khetan, A., Cvitkovic, M., & Karnin, Z. (2020). > Tabtransformer: Tabular data modeling using contextual embeddings. > arXiv preprint arXiv:2012.06678v1.
Check the Kaggle competition website for further information.
This page was built for dataset: hcdr_main