Diabetes(scikit-learn)
OpenML dataset with id 43123
Bradley Efron, Trevor Hastie, Iain Johnstone and Robert Tibshirani
Full work available at URL: https://api.openml.org/data/v1/download/22101600/Diabetes(scikit-learn).arff
Upload date: 9 December 2021
Dataset Characteristics
Number of classes: 0
Number of features: 11 (numeric: 11, symbolic: 0 and in total binary: 0 )
Number of instances: 442
Number of instances with missing values: 0
Number of missing values: 0
.. _diabetes_dataset:
Diabetes dataset
Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442 diabetes patients, as well as the response of interest, a quantitative measure of disease progression one year after baseline.
Data Set Characteristics:
:Number of Instances: 442
:Number of Attributes: First 10 columns are numeric predictive values
:Target: Column 11 is a quantitative measure of disease progression one year after baseline
:Attribute Information:
- Age
- Sex
- Body mass index
- Average blood pressure
- S1
- S2
- S3
- S4
- S5
- S6
Note: Each of these 10 feature variables have been mean centered and scaled by the standard deviation times `n_samples` (i.e. the sum of squares of each column totals 1).
Source URL: https://www4.stat.ncsu.edu/~boos/var.select/diabetes.html
For more information see:
Bradley Efron, Trevor Hastie, Iain Johnstone and Robert Tibshirani (2004) "Least Angle Regression," Annals of Statistics (with discussion), 407-499.
(https://web.stanford.edu/~hastie/Papers/LARS/LeastAngle_2002.pdf)
This page was built for dataset: Diabetes(scikit-learn)