View Submission - CMStatistics

B2000
**Title: **Sequential online subsampling for thinning experimental designs
**Authors: **Luc Pronzato - CNRS - Universite Cote d'Azur (France) **[presenting]**

HaiYing Wang - University of Connecticut (United States)

**Abstract: **In the considered design problem, experimental conditions (design points $X_i$) are presented in the form of a sequence of i.i.d.\ random variables, generated with an unknown probability measure $\mu$, and only a given proportion $\alpha\in(0,1)$ can be accepted. The objective is to select good candidates $X_i$ on the fly and maximise a concave function $\Phi$ of the information matrix. The optimal solution corresponds to the construction of an optimal bounded design measure $\xi_\alpha^*\leq \mu/\alpha$. The difficulty is that $\mu$ is unknown and $\xi_\alpha^*$ must be constructed online. The construction proposed relies on the definition of a threshold $\tau$ on the directional derivative of $\Phi$ at the current information matrix, the value of $\tau$ being fixed by a certain quantile of the distribution of this directional derivative. Combination with recursive quantile estimation yields a nonlinear two-time-scale stochastic approximation method. It can be applied to very long design sequences, since only the current information matrix and estimated quantile need to be stored. Convergence to an optimum design is proved. Various illustrative examples are presented.

HaiYing Wang - University of Connecticut (United States)