Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Tweets-with-keyword-lockdown-in-April-July-2020 - MaRDI portal

Deprecated: Use of MediaWiki\Skin\SkinTemplate::injectLegacyMenusIntoPersonalTools was deprecated in Please make sure Skin option menus contains `user-menu` (and possibly `notifications`, `user-interface-preferences`, `user-page`) 1.46. [Called from MediaWiki\Skin\SkinTemplate::getPortletsTemplateData in /var/www/html/w/includes/Skin/SkinTemplate.php at line 691] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of MediaWiki\Skin\BaseTemplate::getPersonalTools was deprecated in 1.46 Call $this->getSkin()->getPersonalToolsForMakeListItem instead (T422975). [Called from Skins\Chameleon\Components\NavbarHorizontal\PersonalTools::getHtml in /var/www/html/w/skins/chameleon/src/Components/NavbarHorizontal/PersonalTools.php at line 66] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of QuickTemplate::(get/html/text/haveData) with parameter `personal_urls` was deprecated in MediaWiki Use content_navigation instead. [Called from MediaWiki\Skin\QuickTemplate::get in /var/www/html/w/includes/Skin/QuickTemplate.php at line 131] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Tweets-with-keyword-lockdown-in-April-July-2020

From MaRDI portal
Dataset:6036887



OpenML43794MaRDI QIDQ6036887

OpenML dataset with id 43794

Author name not available (Why is that?)

Full work available at URL: https://api.openml.org/data/v1/download/22102619/Tweets-with-keyword-lockdown-in-April-July-2020.arff

Upload date: 24 March 2022



Dataset Characteristics

Number of features: 7 (numeric: 3, symbolic: 0 and in total binary: 0 )
Number of instances: 95,488
Number of instances with missing values: 90,899
Number of missing values: 160,244

Context This data was collected to be used with an academic project of mine. The project was about sentiment analysis of tweets during lockdown. Content I used the GetOldTweets3 (https://pypi.org/project/GetOldTweets3/) python3 library to pull the tweets off Twitter. The tweets range between 1 April 2020 to 1 August 2020, which was the peak lockdown period in India. Tweets with duplicate text and NaN values and that was the only cleaning I did on the data. Total rows of tweets: 95488 Columns:

Index (be sure to use df = pandas.read_csv("tweets_lockdown.csv", index_col=0)) Text - The text of the tweet Date - Date and time of tweet in datetime format Retweets - Number of retweets for the tweet Favorites - Favorites on the tweet Mentions - Usernames mentioned in the tweets in format HashTags - Hashtags present in the tweet in format

"Top Tweets" attribute was turned off while scraping. Inspiration Twitter data gives us a lot of scope for data cleaning, text preprocessing, association rule mining, sentiment analysis and so on.






This page was built for dataset: Tweets-with-keyword-lockdown-in-April-July-2020