risk-factors-cervical
OpenML dataset with id 42911
Jaime S. Cardoso, Jessica Fernandes, Kelwin Fernandes
Full work available at URL: https://api.openml.org/data/v1/download/22045542/risk-factors-cervical.arff
Upload date: 19 May 2021
Dataset Characteristics
Number of features: 36 (numeric: 10, symbolic: 0 and in total binary: 0 )
Number of instances: 858
Number of instances with missing values: 799
Number of missing values: 3,622
Author: Kelwin Fernandes, Jaime S. Cardoso, Jessica Fernandes Source: UCI - 2017 Please cite: Paper
Cervical cancer (Risk Factors) Data Set
The dataset was collected at 'Hospital Universitario de Caracas' in Caracas, Venezuela. The dataset comprises demographic information, habits, and historic medical records of 858 patients. Several patients decided not to answer some of the questions because of privacy concerns (missing values).
Attribute information
- (int) Age
- (int) Number of sexual partners
- (int) First sexual intercourse (age)
- (int) Num of pregnancies
- (bool) Smokes
- (bool) Smokes (years)
- (bool) Smokes (packs/year)
- (bool) Hormonal Contraceptives
- (int) Hormonal Contraceptives (years)
- (bool) IUD
- (int) IUD (years)
- (bool) STDs
- (int) STDs (number)
- (bool) STDs:condylomatosis
- (bool) STDs:cervical condylomatosis
- (bool) STDs:vaginal condylomatosis
- (bool) STDs:vulvo-perineal condylomatosis
- (bool) STDs:syphilis
- (bool) STDs:pelvic inflammatory disease
- (bool) STDs:genital herpes
- (bool) STDs:molluscum contagiosum
- (bool) STDs:AIDS
- (bool) STDs:HIV
- (bool) STDs:Hepatitis B
- (bool) STDs:HPV
- (int) STDs: Number of diagnosis
- (int) STDs: Time since first diagnosis
- (int) STDs: Time since last diagnosis
- (bool) Dx:Cancer
- (bool) Dx:CIN
- (bool) Dx:HPV
- (bool) Dx
- (bool) Hinselmann: target variable
- (bool) Schiller: target variable
- (bool) Cytology: target variable
- (bool) Biopsy: target variable
This page was built for dataset: risk-factors-cervical