vtreat (Q27746)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: vtreat |
A Statistically Sound 'data.frame' Processor/Conditioner
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | vtreat |
A Statistically Sound 'data.frame' Processor/Conditioner |
Statements
19 August 2023
0 references
A 'data.frame' processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. 'vtreat' prepares variables so that data has fewer exceptional cases, making it easier to safely use models in production. Common problems 'vtreat' defends against: 'Inf', 'NA', too many categorical levels, rare categorical levels, and new categorical levels (levels seen during application, but not during training). Reference: "'vtreat': a data.frame Processor for Predictive Modeling", Zumel, Mount, 2016, <doi:10.5281/zenodo.1173313>.
0 references
Identifiers
1 September 2023
0 references