Comparing Variable Selection Techniques Under a High-Dimensional Asymptotic

Arian Maleki – Columbia University

Abstract

In this talk, we discuss the problem of variable selection for linear models under the high-dimensional asymptotic setting, where the number of observations, n, grows at the same rate as the number of predictors, p. We consider two-stage variable selection techniques (TVS) in which the first stage obtains an estimate of the regression coefficients, and the second stage simply thresholds this estimate to select the “important” predictors. The asymptotic false discovery proportion (AFDP) and true positive proportion (ATPP) of these TVS are evaluated, and their optimality will be discussed.

Based on joint work with Haolei Weng and Shuaiwen Wang.

Related Preprints:

https://projecteuclid.org/journals/annals-of-statistics/volume-48/issue-5/Which-bridge-estimator-is-the-best-for-variable-selection/10.1214/19-AOS1906.short

https://arxiv.org/abs/1909.09345