Traffic_violations
OpenML dataset with id 42345
No author found.
Full work available at URL: https://api.openml.org/data/v1/download/21801024/Traffic_violations.arff
Upload date: 3 April 2020
Dataset Characteristics
Number of classes: 3
Number of features: 21 (numeric: 1, symbolic: 20 and in total binary: 6 )
Number of instances: 70,340
Number of instances with missing values: 957
Number of missing values: 2,288
This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle owner or the officer issuing the violation will not be published. For this version, some features were removed and all remaining character features were recoded as nominal factor variables. All punctuation characters were removed from factor levels.
The variable 'Violation.Type' is used as target by default. The smaller target categories 'SERO' and 'ESERO' were collapsed into one category labeled 'SERO'. For this version, the dataset was downsampled to 5% of the original size. Unused factor levels and a few almost constant features were dropped.
This page was built for dataset: Traffic_violations