Disaster-Tweets
OpenML dataset with id 43395
No author found.
Full work available at URL: https://api.openml.org/data/v1/download/22102220/Disaster-Tweets.arff
Upload date: 23 March 2022
Dataset Characteristics
Number of features: 5 (numeric: 2, symbolic: 0 and in total binary: 0 )
Number of instances: 11,370
Number of instances with missing values: 3,535
Number of missing values: 3,535
Context The file contains over 11,000 tweets associated with disaster keywords like crash, quarantine, and bush fires as well as the location and keyword itself. The data structure was inherited from Disasters on social media The tweets were collected on Jan 14th, 2020. Some of the topics people were tweeting:
The eruption of Taal Volcano in Batangas, Philippines Coronavirus Bushfires in Australia Iran downing of the airplane flight PS752
Disclaimer: The dataset contains text that may be considered profane, vulgar, or offensive.
Inspiration
The intention was to enrich the already available data for this topic with newly collected and manually classified tweets.
The initial source Disasters on social media which is used in Real or Not? NLP with Disaster Tweets competition on Kaggle.
This page was built for dataset: Disaster-Tweets