COMPSTAT 2016: Start Registration
View Submission - COMPSTAT
A0407
Title: A toolkit for stability assessment of tree-based learners Authors:  Michel Philipp - University of Zurich (Switzerland) [presenting]
Achim Zeileis - Universitaet Innsbruck (Austria)
Carolin Strobl - University of Zurich (Switzerland)
Abstract: Recursive partitioning techniques are established and frequently applied for exploring unknown structures in complex and possibly high-dimensional data sets. The methods can be used to detect interactions and nonlinear structures in a data-driven way by recursively splitting the predictor space to form homogeneous groups of observations. However, while the resulting trees are easy to interpret, they are also known to be potentially unstable. Altering the data slightly can change either the variables and/or the cutpoints selected for splitting. Moreover, the methods do not provide measures of confidence for the selected splits and therefore users cannot assess the uncertainty of a given fitted tree. We present a toolkit of descriptive measures and graphical illustrations based on resampling, that can be used to assess the stability of the variable and cutpoint selection in recursive partitioning. The summary measures and graphics available in the toolkit are illustrated using a real world data set and implemented in the \textsf{R}~package \textbf{stablelearner}.