Goodreads-Books---31-Features
OpenML dataset with id 43438
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/22102263/Goodreads-Books---31-Features.arff
Upload date: 23 March 2022
Dataset Characteristics
Number of features: 31 (numeric: 10, symbolic: 0 and in total binary: 0 )
Number of instances: 52,199
Number of instances with missing values: 52,199
Number of missing values: 285,951
Context
The official Goodread's API limits retrievable data, so I decided to scrape the actual HTTP pages and grab additional details on each book.
Content
Books are scraped from a list titles the "Best Books Ever" which can be found here https://www.goodreads.com/list/show/1.Best_Books_Ever
Acknowledgements
Thanks to Goodreads for housing the data.
This page was built for dataset: Goodreads-Books---31-Features