Cheng-Yu Sun
Title: Factorial Designs under Baseline Parameterization and Space-filling Designs with Applications to Big Data
Date: June 18, 2021
Time: 1:30 pm (PDT)
Location: Remote delivery
Abstract
This dissertation reports my research work on three topics in the areas of two-level factorial designs under the baseline parameterization, space-filling designs, and subdata selection for big data. When studying two-level factorial designs, factorial effects are usually given by the orthogonal parameterization. But if each factor has an intrinsic baseline level, the baseline parameterization is a more appropriate alternative. We obtain a relationship between these two types of parameterization, and show that certain design properties are invariant. The relationship also allows us to construct an attractive class of robust baseline designs. We then consider two classes of space-filling designs driven by very different considerations: uniform projection designs and strong orthogonal arrays (SOAs), where the former are obtained by minimizing the uniform projection criterion while the latter are a special kind of orthogonal arrays. We express the uniform projection criterion in terms of the stratification characteristics related to an SOA. This new expression is then used to show that certain SOAs are optimal or nearly optimal under the uniform projection criterion. Finally, we consider the problem of selecting a representative subdata from a big data for the purpose of statistical analyses without massive computation. Under the nonparametric regression situation, we present a two-phase selection method, which embodies two important ideas. First, the subdata points should be a space-filling subset within the full data. Second, in the area where the response surface is more rugged, more data points should be selected. Simulations are conducted to demonstrate the usefulness of our method.
Keywords: centered L2-discrepancy; computer experiment; minimum aberration, orthogonal; array; subdata selection