Code_Smells_Blob
OpenML dataset with id 43078
No author found.
Full work available at URL: https://api.openml.org/data/v1/download/22045959/Code_Smells_Blob.arff
Upload date: 10 August 2021
Dataset Characteristics
Number of classes: 0
Number of features: 67 (numeric: 67, symbolic: 0 and in total binary: 0 )
Number of instances: 83,943
Number of instances with missing values: 83,943
Number of missing values: 2,801,627
This dataset combines records from the MLCQ dataset with metrics extracted using the PMD Tool and the Understand tool, to determine whether a file contains code smells. Please note that the records are on (sub)class level. Classification task, the default class (severity) should be binarized with a static threshold (preferably between 0.5 and 2.5). Please carefully read the publication to understand how to use this dataset.
This page was built for dataset: Code_Smells_Blob