Otto-Group-Product-Classification-Challenge
OpenML dataset with id 45548
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/22116516/Otto-Group-Product-Classification-Challenge.arff
Upload date: 2 June 2023
Dataset Characteristics
Number of classes: 9
Number of features: 94 (numeric: 93, symbolic: 1 and in total binary: 0 )
Number of instances: 61,878
Number of instances with missing values: 0
Number of missing values: 0
Overview
The Otto Group is one of the world's biggest e-commerce companies, with subsidiaries in more than 20 countries, including Crate & Barrel (USA), Otto.de (Germany) and 3 Suisses (France). We are selling millions of products worldwide every day, with several thousand products being added to our product line.
A consistent analysis of the performance of our products is crucial. However, due to our diverse global infrastructure, many identical products get classified differently. Therefore, the quality of our product analysis depends heavily on the ability to accurately cluster similar products. The better the classification, the more insights we can generate about our product range.
For this competition, we have provided a dataset with 93 features for more than 200,000 products. The objective is to build a predictive model which is able to distinguish between our main product categories. The winning models will be open sourced.
Data description
Each row corresponds to a single product. There are a total of 93 numerical features, which represent counts of different events. All features have been obfuscated and will not be defined any further.
There are nine categories for all products. Each target category represents one of our most important product categories (like fashion, electronics, etc.). The products for the training and testing sets are selected randomly.
Data fields
- id - an anonymous id unique to a product
- feat_1, feat_2, ..., feat_93 - the various features of a product
- target - the class of a product
Notes by Uploader to OpenML
- This is only the training set. The test set is not publicly available.
This page was built for dataset: Otto-Group-Product-Classification-Challenge