Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Cosmetics-datasets - MaRDI portal

Cosmetics-datasets

From MaRDI portal
Dataset:6036580



OpenML43481MaRDI QIDQ6036580

OpenML dataset with id 43481

Author name not available (Why is that?)

Full work available at URL: https://api.openml.org/data/v1/download/22102306/Cosmetics-datasets.arff

Upload date: 23 March 2022



Dataset Characteristics

Number of features: 11 (numeric: 7, symbolic: 0 and in total binary: 0 )
Number of instances: 1,472
Number of instances with missing values: 0
Number of missing values: 0

Context Whenever I want to try a new cosmetic item, it's so difficult to choose. It's actually more than difficult. It's sometimes scary because new items that I've never tried end up giving me skin trouble. We know the information we need is on the back of each product, but it's really hard to interpret those ingredient lists unless you're a chemist. You may be able to relate to this situation.

Content we are going to create a content-based recommendation system where the 'content' will be the chemical components of cosmetics. Specifically, we will process ingredient lists for 1472 cosmetics on Sephora via word embedding, then visualize ingredient similarity using a machine learning method called t-SNE and an interactive visualization library called Bokeh. Let's inspect our data first.

Acknowledgements DataCamp






This page was built for dataset: Cosmetics-datasets