An algorithm for discretization of real value attributes based on interval similarity (Q2375510)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: An algorithm for discretization of real value attributes based on interval similarity |
scientific article
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | An algorithm for discretization of real value attributes based on interval similarity |
scientific article |
Statements
An algorithm for discretization of real value attributes based on interval similarity (English)
0 references
14 June 2013
0 references
Summary: Discretization algorithms for real value attributes are of very important uses in many areas such as intelligence and machine learning. The algorithms related to Chi2 algorithm (includes modified Chi2 algorithm and extended Chi2 algorithm) are famous discretization algorithms exploiting the technique of probability and statistics. In this paper, the algorithms are analyzed, and their drawback is pointed. Based on the analysis a new modified algorithm based on interval similarity is proposed. The new algorithm defines an interval similarity function which is regarded as a new merging standard in the process of discretization. At the same time, two important parameters (condition parameter \(\alpha\) and tiny move parameter \(c\)) in the process of discretization and discrepancy extent of a number of adjacent two intervals are given in the form of function. The related theory analysis and the experiment results show that the presented algorithm is effective.
0 references
numerical examples
0 references
real value attributes
0 references
Chi2 algorithm
0 references
probability
0 references
statistics
0 references
interval similarity
0 references