pair0003
OpenML dataset with id 43297
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/22102073/pair0003.arff
Upload date: 16 March 2022
Dataset Characteristics
Number of features: 2 (numeric: 2, symbolic: 0 and in total binary: 0 )
Number of instances: 348
Number of instances with missing values: 0
Number of missing values: 0
//Add the description.md of the data file pair0003 Cause-effect is a growing database with two-variable cause-effect pairs created at Max-Planck-Institute for Biological Cybernetics in Tuebingen, Germany.
======================================================================================================================================
Some pairs are highdimensional, for machine readability the relevant information about this is coded in Meta-data.
Meta-data contains the following information:
number of pair | 1st column of cause | last column of cause | 1st column of effect | last column of effect | dataset weight
The dataset weight should be used for calculating average performance of causal inference methods to avoid a bias introduced by having multiple copies of essentially the same data (for example, the pairs 56-63).
When you use this data set in a publication, please cite the following paper (which also contains much more detailed information regarding this data set in the supplement):
J. M. Mooij, J. Peters, D. Janzing, J. Zscheischler, B. Schoelkopf "Distinguishing cause from effect using observational data: methods and benchmarks" Journal of Machine Learning Research 17(32):1-102, 2016
NOTE: pair0001 - pair0041 are taken from the UCI Machine Learning Repository:
Asuncion, A. & Newman, D.J. (2007). UCI Machine Learning Repository [1]. Irvine, CA: University of California, School of Information and Computer Science.
======================================================================================================================================
Overview over all data pairs.
var 1 var 2 dataset ground truth
pair0001 Altitude Temperature DWD -> pair0002 Altitude Precipitation DWD -> pair0003 Longitude Temperature DWD -> pair0004 Altitude Sunshine hours DWD ->
Information for pairs0003:
DWD data (Deutscher Wetterdienst)
data was taken at 349 stations
x: longitude
y: temperature (averaged over 1961-1990)
ground truth:
x --> y
This page was built for dataset: pair0003