HotpotQA_distractor
OpenML dataset with id 45573
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/22116557/HotpotQA_distractor.arff
Upload date: 15 June 2023
Dataset Characteristics
Number of features: 7 (numeric: 0, symbolic: 0 and in total binary: 0 )
Number of instances: 97,852
Number of instances with missing values: 0
Number of missing values: 0
HotpotQA is a new dataset with 113k Wikipedia-based question-answer pairs with four key features: (1) the questions require finding and reasoning over multiple supporting documents to answer; (2) the questions are diverse and not constrained to any pre-existing knowledge bases or knowledge schemas; (3) we provide sentence-level supporting facts required for reasoning, allowingQA systems to reason with strong supervision and explain the predictions; (4) we offer a new type of factoid comparison questions to test QA systems' ability to extract relevant facts and perform necessary comparison. The dataset is taken from https://huggingface.co/datasets/hotpot_qa and this upload is the 'distractor' subset.
This page was built for dataset: HotpotQA_distractor