audit-data

OpenML dataset with id 42931

Author name not available (Why is that?)

Full work available at URL: https://api.openml.org/data/v1/download/22045580/audit-data.arff

Upload date: 25 May 2021

Dataset Characteristics

Number of classes: 0
Number of features: 37 (numeric: 36, symbolic: 0 and in total binary: 0 )
Number of instances: 1,552
Number of instances with missing values: 1,552
Number of missing values: 19,402

Description

Author: Nishtha Hooda, CSED, TIET, Patiala

Source: UCI - 2018

Please cite: [ https://doi.org/10.1080/08839514.2018.1451032 Hooda, Nishtha, Seema Bawa, and Prashant Singh Rana. 'Fraudulent Firm Classification: A Case Study of an External Audit.' Applied Artificial Intelligence 32.1 (2018): 48-64.]

The goal of the research is to help the auditors by building a classification model that can predict the fraudulent firm on the basis the present and historical risk factors. The information about the sectors and the counts of firms are listed respectively as Irrigation (114), Public Health (77), Buildings and Roads (82), Forest (70), Corporate (47), Animal Husbandry (95), Communication (1), Electrical (4), Land (5), Science and Technology (3), Tourism (1), Fisheries (41), Industries (37), Agriculture (200). The original dataset was separated into a trial and audit dataset. In this dataset these are concatenated into 1 dataset. Two features (trial and audit) have been added to indicate whether the data was originally from the trial or audit dataset.

This page was built for dataset: audit-data