langLog
OpenML dataset with id 40593
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/4644186/langLog.arff
Upload date: 16 February 2017
Dataset Characteristics
Number of classes: 2
Number of features: 1,079 (numeric: 1,004, symbolic: 75 and in total binary: 75 )
Number of instances: 1,460
Number of instances with missing values: 0
Number of missing values: 0
The langLog dataset includes 1004 textual predictors and was originally compiled in the doctorial thesis of Read (2010). It consists of 956 text samples that can be assigned to one or more topics such as language, politics, errors, humor and computational linguistics. Note that the data on OpenML uses modified names for taget labels which were longer than 18 characters.
This page was built for dataset: langLog