Sequential quantiles via Hermite series density estimation (Q89436)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Sequential quantiles via Hermite series density estimation
scientific article

    Statements

    11
    0 references
    1
    0 references
    1 January 2017
    0 references
    7 April 2017
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    Sequential quantiles via Hermite series density estimation (English)
    0 references
    Let \(x_1,x_2,\dots,x_n\) be iid random variables with density function \(f(x)\) and distribution function \(F(x)\). The empirical distribution function is defined as \(\widehat{F}(x)=\frac{1}{n}\sum_{i=1}^{n}I\{x_i \leq x\}\), where \(I\) is the usual indicator function. Clearly, \(\widehat{F}(x)\) can also be written in terms of order statistics \(x_{(i)}\), \(i\geq 1\). The kernel distribution function estimator is defined as \(\widehat{F_1}(x)=\frac{1}{n}\sum_{i=1}^{n}\int_{-\infty}^{\infty} \frac{1}{h_n} K\Big(\frac{x_i-x}{h_n}\Big) dx\), where the kernel function \(K\) is a non-negative function integrating to one with mean zero and bandwidth \(h_n >0\) as a smoothing parameter. The quantile function is defined as: \(q(p)=\text{inf}\{x:F(x)\geq p \}\). The \(p\)-th quantile can be estimated from the order statistics. For example, if \(p\in \Big(\frac{i-1}{n},\frac{i}{n}\Big]\), then \(\hat{q}(p)=x_{(i)}\). For certain values of \(p\), \(\hat{q}(p)\) is interpolated between \(x_{(i)}\) and \(x_{(i+1)}\). To improve the efficiency of such estimators, certain weighted average of several order statistics with suitable weights (known as L-estimators) is used as an alternative estimator. Perhaps the most popular class of such estimators uses the kernel as the weight function. Thus the kernel quantile estimator is defined as: \(\hat{q}(p)=\sum_{i=1}^{n} \int_{\frac{i-1}{n}}^{\frac{i}{n}} \frac{1}{h_n}K\Big(\frac{s-p}{h_n}\Big)\, x_{(i)}\, ds\). After providing some background, the authors use Hermite polynomials to develop the Gauss-Hermite expansion of the density function \(f(x)\), which leads to a Gauss-Hermite based estimator of the cumulative distribution function (CDF). They apply these methods to a obtain sequential quantile estimation in the settings of static and dynamic quantile estimation. The treatment of quantile estimation is made concrete by considering cases of independent and identically distributed and non-identically distributed data streams. Observations are continuous random variables observed sequentially one at a time. The quality of the Gauss-Hermite based CDF estimator and the quantile estimation is investigated. The proposed techniques are compared with existing leading algorithms for both simulated data and real data, and suitable conclusions are made.
    0 references
    0 references
    sequential quantile estimation
    0 references
    online quantile estimation
    0 references
    sequential distribution function estimation
    0 references
    online distribution function estimation
    0 references

    Identifiers