Used-Cars-Dataset
OpenML dataset with id 43305
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/22102130/Used-Cars-Dataset.arff
Upload date: 23 March 2022
Dataset Characteristics
Number of features: 25 (numeric: 6, symbolic: 0 and in total binary: 0 )
Number of instances: 426,880
Number of instances with missing values: 426,880
Number of missing values: 1,655,336
Context Craigslist is the world's largest collection of used vehicles for sale, yet it's very difficult to collect all of them in the same place. I built a scraper for a school project and expanded upon it later to create this dataset which includes every used vehicle entry within the United States on Craigslist.
Content This data is scraped every few months, it contains most all relevant information that Craigslist provides on car sales including columns like price, condition, manufacturer, latitude/longitude, and 18 other categories. For ML projects, consider feature engineering on location columns such as long/lat. For previous listings, check older versions of the dataset.
See https://github.com/AustinReese/UsedVehicleSearch
This page was built for dataset: Used-Cars-Dataset