Variable Selection Using Bayesian Additive Regression Trees

From MaRDI portal
Publication:109608

DOI10.48550/ARXIV.2112.13998arXiv2112.13998MaRDI QIDQ109608

Chuji Luo, Michael J. Daniels

Publication date: 28 December 2021

Abstract: Variable selection is an important statistical problem. This problem becomes more challenging when the candidate predictors are of mixed type (e.g. continuous and binary) and impact the response variable in nonlinear and/or non-additive ways. In this paper, we review existing variable selection approaches for the Bayesian additive regression trees (BART) model, a nonparametric regression model, which is flexible enough to capture the interactions between predictors and nonlinear relationships with the response. An emphasis of this review is on the capability of identifying relevant predictors. We also propose two variable importance measures which can be used in a permutation-based variable selection approach, and a backward variable selection procedure for BART. We present simulations demonstrating that our approaches exhibit improved performance in terms of the ability to recover all the relevant predictors in a variety of data settings, compared to existing BART-based variable selection methods.







Related Items (1)






This page was built for publication: Variable Selection Using Bayesian Additive Regression Trees

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q109608)