youtube-spam-shakira
OpenML dataset with id 42908
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/22045539/youtube-spam-shakira.arff
Upload date: 19 May 2021
Dataset Characteristics
Number of features: 5 (numeric: 1, symbolic: 0 and in total binary: 0 )
Number of instances: 370
Number of instances with missing values: 0
Number of missing values: 0
Author: Unknown Source: UCI - 2017 Please cite*: Paper
YouTube Spam Collection Shakira dataset
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period. This dataset only contains information about Shakira. It consists of 174 spam entries and 196 ham entries, leading to a grand total of 370samples.
Attribute information
The collection is composed by one CSV file per dataset, where each line has the following attributes:
COMMENT_ID,AUTHOR,DATE,CONTENT,TAG
This page was built for dataset: youtube-spam-shakira