EcoSta 2019: Start Registration
View Submission - EcoSta2019
A0260
Title: Fast and approximate exhaustive variable selection for GLMs with APES Authors:  Kevin Wang - The University of Sydney (Australia) [presenting]
Garth Tarr - University of Sydney (Australia)
Jean Yang - The University of Sydney (Australia)
Samuel Muller - University of Sydney (Australia)
Abstract: Obtaining maximum likelihood estimates for generalised linear models (GLMs) is computationally intensive and remains as the major obstacle for performing exhaustive variable selection. On the other hand, efficient algorithms for exhaustive searches do exist for linear models, most notably the leaps and bound algorithm and, more recently, the mixed integer optimisation algorithm. We present APES (APproximated Exhaustive Search), a new method that approximates all subset selection for a given GLM by reformulating the problem as a linear model. The method works by learning from observational weights in a correct/saturated generalised linear regression model. APES can be used in partnership with any other state- of-the-art linear model selection algorithm, thus enabling (approximate) exhaustive model exploration in dimensions much higher than previously feasible. We will demonstrate that APES model selection is competitive against genuine exhaustive search via simulation studies and applications to health data. The APES method is made available in R through the mplot package.