Asia_dataset
OpenML dataset with id 43151
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/22101746/Asia_dataset.arff
Upload date: 31 January 2022
Dataset Characteristics
Number of features: 8 (numeric: 0, symbolic: 8 and in total binary: 8 )
Number of instances: 5,000
Number of instances with missing values: 0
Number of missing values: 0
Dataset description A synthetic dataset from Lauritzen and Spiegelhalter (1988) about lung diseases (tuberculosis, lung cancer or bronchitis) and visits to Asia.
Format of the dataset
A data frame with 5000 rows and 8 binary variables:
D (dyspnoea), binary 1/0 corresponding to "yes" and "no"
T (tuberculosis), binary 1/0 corresponding to "yes" and "no"
L (lung cancer), binary 1/0 corresponding to "yes" and "no"
B (bronchitis), binary 1/0 corresponding to "yes" and "no"
A (visit to Asia), binary 1/0 corresponding to "yes" and "no"
S (smoking), binary 1/0 corresponding to "yes" and "no"
X (chest X-ray), binary 1/0 corresponding to "yes" and "no"
E (tuberculosis versus lung cancer/bronchitis), binary 1/0 corresponding to "yes" and "no"
Source https://www.bnlearn.com/bnrepository/
References
Lauritzen S, Spiegelhalter D (1988). 'Local Computation with Probabilities on Graphical Structures and their Application to Expert Systems (with discussion)'. Journal of the Royal Statistical Society: Series B 50, 157-224.
This page was built for dataset: Asia_dataset