Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
AfriSenti - MaRDI portal

AfriSenti

From MaRDI portal
Dataset:6038103



OpenML45282MaRDI QIDQ6038103

OpenML dataset with id 45282

S. H. Muhammad, I. Abdulmumin, A. A. Ayele, N. Ousidhoum, D. I. Adelani, S. M. Yimam, I. S. Ahmad, et al.

Full work available at URL: https://api.openml.org/data/v1/download/22116250/AfriSenti.arff

Upload date: 16 May 2023



Dataset Characteristics

Number of classes: 3
Number of features: 4 (numeric: 0, symbolic: 2 and in total binary: 0 )
Number of instances: 111,720
Number of instances with missing values: 0
Number of missing values: 0

We introduce AfriSenti, which consists of 14 sentiment datasets of 110,000+ tweets in 14 African languages (Amharic, Algerian Arabic, Hausa, Igbo, Kinyarwanda, Moroccan Arabic, Mozambican Portuguese, Nigerian Pidgin, Oromo, Swahili, Tigrinya, Twi, Xitsonga, and \yoruba) from four language families annotated by native speakers. The data was used in SemEval 2023 Task 12, the first Afro-centric SemEval shared task. We hope AfriSenti enables new work on under-represented languages. The dataset is available at https://github.com/afrisenti-semeval/afrisent-semeval-2023.






This page was built for dataset: AfriSenti