MammoTab 22: a giant and comprehensive dataset for Semantic Table Interpretation
From MaRDI portal
DOI10.5281/zenodo.7014472Zenodo7014472MaRDI QIDQ6702599
Dataset published at Zenodo repository.
Author name not available (Why is that?)
Publication date: 22 August 2022
Copyright license: No records found.
MammoTab is a dataset designed to evaluate semantic table annotation approaches. It includes two types of annotation: cell/mentions to Knowledge Graph (KG) entity matching (CEA task) and; column to KGclass matching (CTA task). It is composed of 980254 tables extracted from 21149260 Wikipedia pages and annotated through Wikidata v. 20220708. The dataset is compliant with the data format used inSemTab2019.
This page was built for dataset: MammoTab 22: a giant and comprehensive dataset for Semantic Table Interpretation