Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
pair0005 - MaRDI portal

pair0005

From MaRDI portal
Dataset:6036411



OpenML43299MaRDI QIDQ6036411

OpenML dataset with id 43299

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/22102075/pair0005.arff

Upload date: 16 March 2022



Dataset Characteristics

Number of features: 2 (numeric: 2, symbolic: 0 and in total binary: 0 )
Number of instances: 4,176
Number of instances with missing values: 0
Number of missing values: 0

//Add the description.md of the data file pair0005

Cause-effect is a growing database with two-variable cause-effect pairs created at Max-Planck-Institute for Biological Cybernetics in Tuebingen, Germany.

======================================================================================================================================

Some pairs are highdimensional, for machine readability the relevant information about this is coded in Meta-data.

Meta-data contains the following information:

number of pair | 1st column of cause | last column of cause | 1st column of effect | last column of effect | dataset weight

The dataset weight should be used for calculating average performance of causal inference methods to avoid a bias introduced by having multiple copies of essentially the same data (for example, the pairs 56-63).

When you use this data set in a publication, please cite the following paper (which also contains much more detailed information regarding this data set in the supplement):

J. M. Mooij, J. Peters, D. Janzing, J. Zscheischler, B. Schoelkopf "Distinguishing cause from effect using observational data: methods and benchmarks" Journal of Machine Learning Research 17(32):1-102, 2016

NOTE: pair0001 - pair0041 are taken from the UCI Machine Learning Repository:

Asuncion, A. & Newman, D.J. (2007). UCI Machine Learning Repository [1]. Irvine, CA: University of California, School of Information and Computer Science.

======================================================================================================================================

Overview over all data pairs.

var 1 var 2 dataset ground truth

pair0001 Altitude Temperature DWD -> pair0002 Altitude Precipitation DWD -> pair0003 Longitude Temperature DWD -> pair0004 Altitude Sunshine hours DWD ->

Information for pairs0005:

https://archive.ics.uci.edu/ml/datasets/Abalone

1. Title of Database: Abalone data

2. Sources:

  (a) Original owners of database:

Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania GPO Box 619F, Hobart, Tasmania 7001, Australia (contact: Warwick Nash +61 02 277277, wnash@dpi.tas.gov.au)

  (b) Donor of database:

Sam Waugh (Sam.Waugh@cs.utas.edu.au) Department of Computer Science, University of Tasmania GPO Box 252C, Hobart, Tasmania 7001, Australia

  (c) Date received: December 1995

3. Attribute information:

  Given is the attribute name, attribute type, the measurement unit and a
  brief description.  

Name Data Type Meas. Description ---- --------- ----- ----------- x: Rings integer +1.5 gives the age in years y: Length continuous mm Longest shell measurement


Ground truth: x --> y




This page was built for dataset: pair0005