Robust Test Statistics Based on Restricted

Full text

Turn on search term navigation

1. Introduction

Let ${(X, β_{X}, P_{θ})}_{θ \in Θ}$ be the statistical space associated with the random variable $X,$ where $β_{X}$ is the $σ$ -field of Borel subsets $A \subset X$ and ${\{P_{θ}\}}_{θ \in Θ}$ is a family of probability distributions defined on the measurable space $(X, β_{X}),$ whit $Θ$ an open subset of $R^{p}$ and $p \geq 1 .$ We assume that the probability measures $P_{θ}$ are described by densities $f_{θ} (x) = d P_{θ} / d μ (x),$ where $μ$ is a $σ$ -finite measure on $(X, β_{X}) .$ Given a random sample $X_{1}, \dots, X_{n},$ of the random variable X with density belonging to the parametric family $P_{θ}$ , the most popular estimator for the model parameter $θ$ is the maximum likelihood estimator (MLE), which maximizes the likelihood function of the assumed model. The MLE has been widely studied in the literature for general statistical models, and it has been shown that, under certain regularity conditions, the sequence of MLEs of $θ,$ ${\hat{θ}}_{n},$ is asymptotically normal and it satisfies some desirable properties, such as consistency and asymptotic efficiency. That is, the MLE is the BAN (best asymptotically normal) estimator. However, in many popular statistical models, the MLE is markedly non-robust against deviations, even very small ones, from the parametric conditions.

To overcome the lack of robustness, minimum distance (or minimum divergence) estimators (MDEs) have been developed. MDEs have received growing attention in statistical inference because of their ability to conciliate efficiency and robustness. In parametric estimation, the role of divergence or distance measures is very intuitive: the estimates of the unknown parameters are obtained by minimizing a suitable divergence measure between the estimated from data and the assumed model distributions. There is a growing body of literature that recognizes the importance of MDEs in terms of robustness, without a significant loss of efficiency, with respect to the MLE. See, for instance, the works of Beran [1], Tamura and Boes [2], Simpson [3,4], Lindsay [5], Pardo [6], and Basu et al. [7] and the references therein.

Let G denote the unknown distribution function, with associated density $g,$ underlying the data. The minimum divergence (distance) functional evaluated at G, $T (G)$ , is defined as

(1) $d (g, f_{T (G)}) = min_{θ \in Θ} d (g, f_{θ}),$

with

d (g, f_{θ})

being a distance or divergence measure between the densities g and

f_{θ} .

As the true distribution underlying the data is unknown, given a random sample, we could estimate the model parameter

θ

, substituting in the previous expression the true distribution G by its empirical estimation

G_{n} .

Therefore, the MDE of

θ

is given by

(2) ${\hat{θ}}_{n} = T (G_{n}),$

When dealing with continuous models, it is convenient to consider families of divergence measures for which non-parametric estimators of the unknown density function are not needed. From this perspective, the density power divergence (DPD) family, leading to the minimum density power divergence estimators (MDPDEs) (see Basu et al. [7]), as well as the Rényi’s pseudodistance (RP), leading to the minimum Rényi’s pseudodistance estimators (MRPE) (see Broniatowski et al. [8]) between others, play an important role. The results presented in Broniatowski et al. [8] in the context of independent and identically distributed random variables were extended for the case of independent but not identically distributed random variables by Castilla et al. [9].

In many situations we have additional knowledge about the true parameter value, as it must satisfy certain constraints. Then, the restricted parameter space has the form

(3) $\{θ \in Θ / g (θ) = 0_{r}\},$

where

0_{r}

denotes the null vector of dimension r, and

g : R^{p} \to R^{r}

is a vector-valued function such that the

p \times r

matrix

(4) $G (θ) = \frac{\partial g^{T} (θ)}{\partial θ}$

exists and is continuous in

θ,

and rank

(G (θ)) = r

. Here, superscript T represents the transpose of the matrix. In the following, the restricted parameter space given in (3) is denoted by

Θ_{0},

as in most situations, it will represent a composite null hypothesis.

The most popular estimator of $θ$ under the non-linear constraint given in (3) is the restricted MLE (RMLE) that maximizes the likelihood function subject to the constraint $g (θ) = 0_{r}$ (see Silvey [10]). The RMLE encounters similar robustness problems to the MLE. To overcome such deficiency, the restricted MDPDEs (RMDPDEs) were introduced in Basu et al. [11] and their theoretical robustness properties were later studied in Ghosh [12].

The main purpose in this paper is extending the theory developed for the MRPE to the restricted parameter space setting, yielding to the restricted MRPE (RMPRE), where the parameter space has the form (3). The rest of the paper is as follows: In Section 2, MRPE is introduced. Section 3 presents RMPRE, and its asymptotic distribution as well as its influence function are obtained. In Section 4, two different test statistics for testing composite null hypothesis, based on the RMRPE, are developed, and explicit expressions of the statistics are presented for testing in normal populations. Section 5 presents a simulation study, where the robustness of the proposed estimators and test statistics is empirically shown. Section 6 deals with real-data situations. Finally, some conclusions are presented in Section 7.

2. Minimum Rényi Pseudodistance Estimators

In this section, we introduce the MRPE. We derive the estimating equations of the MRPE and recall its asymptotic distribution.

Let $X_{1}, \dots, X_{n}$ be a random sample of size n from a population having true and unknown density function $g,$ modeled by a parametric family of densities $f_{θ}$ with $θ \in Θ \subset R^{p} .$ The RP between the densities $f_{θ}$ and g is given, for $τ > 0,$ by

$\begin{matrix} R_{τ} (f_{θ}, g) = & \frac{1}{τ + 1} log (\int f_{θ} {(x)}^{τ + 1} d x) + \frac{1}{τ (τ + 1)} log (\int g {(x)}^{τ + 1} d x) \\ - \frac{1}{τ} log (\int f_{θ} {(x)}^{τ} g (x) d x) . . \end{matrix}$

The RP can be defined for $τ = 0$ taking continuous limits, yielding the expression

$R_{0} (f_{θ}, g) = lim_{τ ↓ 0} R_{τ} (f_{θ}, g) = \int g (x) log \frac{g (x)}{f_{θ} (x)} d x .$

Then, the RP coincides with the Kullback–Leibler divergence (KL) between g and $f_{θ}$ , at $τ = 0$ (see Pardo, 2006).

The RP was considered for the first time by Jones et al. [13]. Later Broniatowski et al. [8] established some useful properties of the divergence, such as the positivity of the RP for any two densities and for all values of the parameter $τ,$ $R_{τ} (f_{θ}, g) \geq 0$ and uniqueness of the minimum RP within a parametric family, that is, $R_{τ} (f_{θ}, g) = 0$ if and only if $f_{θ} = g .$ The last property justifies the definition of the MRPEs as the minimizer of the RP between the assumed distribution and the empirical distribution of the data. It is interesting to note that the so-called RP by Broniatowski et al. [8] had been previously considered by Fujisawa and Eguchi [14] under the name of $γ$ -cross entropy. In that paper, some appealing robustness properties of the estimators based on such entropy are shown.

Given a sample $X_{1}, \dots, X_{n}$ , from Broniatowski et al. [8] it can be seen that minimizing $R_{τ} (f_{θ}, g)$ leads to the following definition.

Definition 1.

Let ${(X, β_{X}, f_{θ})}_{θ \in Θ \subset R^{p}}$ be a statistical space. The MRPE based on the random sample $X_{1}, \dots, X_{n}$ for the unknown parameter θ is given, for $τ > 0$ , by

(5) ${\hat{θ}}_{τ} (X_{1}, \dots, X_{n}) = arg sup_{θ \in Θ} \sum_{i = 1}^{n} \frac{f_{θ} {(X_{i})}^{τ}}{C_{τ} (θ)},$

where

$C_{τ} (θ) = {(\int f_{θ} {(x)}^{τ + 1} d x)}^{\frac{τ}{τ + 1}} .$

Further, at $τ = 0,$ ${\hat{θ}}_{0} (X_{1}, \dots, X_{n})$ minimizes the KL divergence, and thus the MRPE coincides with the MLE for $τ = 0 .$ Based on the previous definition (5), differentiating, we obtain that the estimating equations of the MRPE are given by

(6) $\sum_{i = 1}^{n} Ψ_{τ} (x_{i}; θ) = 0_{p},$

with

(7) $\begin{matrix} Ψ_{τ} (x; θ) & = f_{θ} {(x)}^{τ} (u_{θ} (x) - c_{τ} (θ)), \\ u_{θ} (x) & = {(u_{θ_{1}} (x), \dots, u_{θ_{p}} (x))}^{T}, u_{θ_{i}} (x) = \frac{\partial}{\partial θ_{i}} log f_{θ} (x), \\ \frac{\partial C_{τ} (θ)}{\partial θ} & = C_{τ} (θ) c_{τ} (θ) τ, \end{matrix}$

being

(8) $\begin{matrix} c_{τ} (θ) & = \frac{1}{κ_{τ} (θ)} ξ_{τ} (θ) = {(c_{τ, 1} (θ), \dots, c_{τ, p} (θ))}^{T}, \end{matrix}$

(9) $\begin{matrix} ξ_{τ} (θ) & = \int f_{θ} {(x)}^{τ + 1} u_{θ} (x) d x, \end{matrix}$

(10) $\begin{matrix} κ_{τ} (θ) & = \int f_{θ} {(x)}^{τ + 1} d x . \end{matrix}$

The MRPE is an M-estimator and thus its asymptotic distribution and influence function (IF) can be obtained based on the asymptotic theory of the M-estimators. Broniatowski et al. [8] studied the asymptotic properties and robustness of the MRPEs. The next result recalls the asymptotic distribution of the MRPEs.

Theorem 1.

Let $θ_{0}$ be the true unknown value of $θ .$ Then,

(11) $\sqrt{n} ({\hat{θ}}_{τ} - θ_{0}) \underset{n \to \infty}{\overset{L}{\to}} N (0_{p}, V_{τ} (θ_{0}))$

where

(12) $V_{τ} (θ) = S_{τ} {(θ)}^{- 1} K_{τ} (θ) S_{τ} {(θ)}^{- 1}$

with

(13) $\begin{matrix} S_{τ} (θ) & = - E [\frac{\partial Ψ_{τ} {(X; θ)}^{T}}{\partial θ}], \end{matrix}$

(14) $\begin{matrix} K_{τ} (θ) & = E [Ψ_{τ} (X; θ) Ψ_{τ}^{T} (X; θ)] . \end{matrix}$

Castilla et al. [15] introduced useful notation for the computation of $V_{τ} (θ) .$

(15) $\begin{matrix} S_{τ} (θ) & = J_{τ} (θ) - \frac{1}{κ_{τ} (θ)} ξ_{τ} (θ) ξ_{τ} {(θ)}^{T}, \end{matrix}$

(16) $\begin{matrix} K_{τ} (θ) & = J_{2 τ} (θ) + \frac{1}{κ_{τ} (θ)} (\frac{κ_{2 τ} (θ)}{κ_{τ} (θ)} ξ_{τ} (θ) ξ_{τ} {(θ)}^{T} - ξ_{τ} (θ) ξ_{2 τ} {(θ)}^{T} - ξ_{2 τ} (θ) ξ_{τ} {(θ)}^{T}), \end{matrix}$

where

(17) $J_{τ} (θ) = \int f_{θ} {(x)}^{τ + 1} u_{θ} (x) u_{θ} {(x)}^{T} d x,$

and

κ_{τ} (θ)

and

ξ_{τ} (θ)

are as in (9) and (10), respectively.

Toma and Leoni-Aubin [16] defined new robust and efficient measures based on the RP. Later, Toma et al. [17] considered the MRPE for general parametric models and developed a model selection criterion for regression models. Broniatowski et al. [8] applied the method to the multiple regression model (MRM) with random covariates. Subsequently, Castilla et al. [18] developed Wald-type tests based on MRPE for the MRM, and Castilla et al. [19] studied the MRPE for the MRM in the ultra-high dimensional set-up. Further, Jaenada and Pardo [20,21] considered the MRPE and Wald-type test statistics for generalized linear models (GLM). Despite Wald-type test statistics, there exist others relevant test statistics having an important role in the statistical literature: the likelihood-ratio and Rao (or score) tests, which are based on restricted estimators, usually the RMLE. Then, it makes sense to develop robust versions of these popular statistics based on the RMRPE.

3. The Restricted Minimum Rényi Pseudodistance Estimator: Asymptotic Distribution and Influence Function of RMRPE

In this section, we introduce the RMRPE and we derive its asymptotic distribution. Moreover, we study its robustness properties through its influence function (IF).

Definition 2.

The RMRPE functional ${\tilde{T}}_{τ} (G)$ evaluated at the distribution G is defined by

$R_{τ} (g, f_{{\tilde{T}}_{τ} (G)}) = min_{θ \in Θ_{0}} R_{τ} (g, f_{θ}),$

given that such a minimum exists.

Accordingly, given random sample $X_{1}, \dots, X_{n}$ from the distribution G, the RMRPE of θ is defined as

${\tilde{θ}}_{τ} = arg sup_{θ \in Θ_{0}} \sum_{i = 1}^{n} \frac{f_{θ} {(X_{i})}^{τ}}{C_{τ} (θ)} .$

Next, the result states the asymptotic distribution of the RMRPE, ${\tilde{θ}}_{τ} = {\tilde{T}}_{τ} (G) .$

Theorem 2.

Suppose that the true distribution satisfies the conditions of the model and let us denote by $θ_{0} \in Θ_{0}$ the true parameter. Then, the RMRPE ${\tilde{θ}}_{τ}$ of θ obtained under the constraints $g (θ) = 0_{r}$ has distribution

$n^{1 / 2} ({\tilde{θ}}_{τ} - θ_{0}) \underset{n ⟶ \infty}{\overset{L}{⟶}} N (0_{p}, Σ_{τ} (θ_{0}))$

where

$Σ_{τ} (θ_{0}) = P_{τ}^{*} (θ_{0}) K_{τ} (θ_{0}) P_{τ}^{*} {(θ_{0})}^{T},$

(18) $P_{τ}^{*} (θ_{0}) = S_{τ} {(θ_{0})}^{- 1} - Q_{τ} (θ_{0}) G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1},$

(19) $Q_{τ} (θ_{0}) = S_{τ} {(θ_{0})}^{- 1} G (θ_{0}) {[G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} G (θ_{0})]}^{- 1} .$

and $S_{τ} (θ_{0})$ is defined in (13), evaluated at $θ = θ_{0}$ .

Proof.

See Appendix A. □

To analyze the robustness of an estimator, Hampel et al. [22] introduced the concept of the influence function (IF). Since then, the IF has been widely used in statistical literature to measure robustness in different statistical contexts. Intuitively, the IF describes the effect of an infinitesimal contamination of the model on the estimate. Then, IFs associated to locally robust (B-robust) estimators should be bounded. Let us now obtain the IF of RMRPE and analyze its boundedness to asses the robustness of the proposed estimators. We consider the contaminated model $g_{ε} (x) = (1 - ε) f_{θ} (x) + ε Δ_{x},$ with $Δ_{x}$ the indicator function in $x,$ and we denote ${\tilde{θ}}_{τ, ε} = {\tilde{T}}_{τ} (G_{ε}),$ being $G_{ε}$ the distribution function associated to $g_{ε} .$ By definition, ${\tilde{θ}}_{τ, ε}$ is the minimizer of $R_{τ} (g, f_{θ})$ subject to $g ({\tilde{θ}}_{τ, ε}) = 0 .$ Following the same steps as in Theorem 5 in Broniatowski et al. [8], it can be seen that the influence function of ${\tilde{T}}_{τ}$ in $f_{θ}$ is given by

(20) $I F (x, {\tilde{T}}_{τ}, θ) = M_{τ} {(θ)}^{- 1} [f_{θ} {(x)}^{τ} u_{θ} (x) - c_{τ} (θ) f_{θ} {(x)}^{τ}],$

where

c_{τ} (θ)

was defined in (8) and

$\begin{matrix} M_{τ} (θ) & = \frac{1}{\int f_{θ} {(x)}^{τ + 1} d x} [\int f_{θ} {(x)}^{τ + 1} d x \int f_{θ} {(x)}^{τ + 1} u_{θ} (x) u_{θ} {(x)}^{T} d x \\ - (\int f_{θ} {(x)}^{τ + 1} u_{θ} (x) d x) {(\int f_{θ} {(x)}^{τ + 1} u_{θ} (x) d x)}^{T}], \end{matrix}$

with the additional condition that

g ({\tilde{θ}}_{τ, ε}) = 0 .

Note that expression (20) corresponds to the IF of the unrestricted MRPE. Differentiating this last equation gives, at

ε = 0,

(21) $G {(θ)}^{T} I F (x, {\tilde{T}}_{τ}, θ) = 0 .$

Based on (20) and (21) we have

$(\begin{matrix} M_{τ} (θ) \\ G {(θ)}^{T} \end{matrix}) I F (x, {\tilde{T}}_{τ}, θ) = (\begin{matrix} [f_{θ} {(x)}^{τ} u_{θ} (x) - c_{τ} (θ) f_{θ} {(x)}^{τ}] \\ 0 \end{matrix}) .$

Therefore,

$(\begin{matrix} M_{τ} {(θ)}^{T} & G (θ) \end{matrix}) (\begin{matrix} M_{τ} (θ) \\ G {(θ)}^{T} \end{matrix}) I F (x, {\tilde{T}}_{τ}, θ) = M_{τ} {(θ)}^{T} [f_{θ} {(x)}^{τ} u_{θ} (x) - c_{τ} (θ) f_{θ} {(x)}^{τ}]$

and

(22) $I F (x, {\tilde{T}}_{τ}, θ) = {(M_{τ} {(θ)}^{T} M_{τ} (θ) + G (θ) G {(θ)}^{T})}^{- 1} M_{τ} {(θ)}^{T} [f_{θ} {(x)}^{τ} u_{θ} (x) - c_{τ} (θ) f_{θ} {(x)}^{τ}] .$

Note that matrices $M_{τ} (θ)$ and $G (θ)$ involved in the expression (22) are defined except for the model and tuning parameters $θ$ and $τ$ , and so the boundedness of the IF of the RMRPE depends, therefore, on the boundedness of the factor

$[f_{θ} {(x)}^{τ} u_{θ} (x) - c_{τ} (θ) f_{θ} {(x)}^{τ}] .$

Therefore, the boundedness of the IF of the RMRPE depends directly on the boundedness of IF of the MRPE, stated in (20). The IF of the MRPE has been widely studied for general statistical models, concluding that the MRPEs are robust for positive values of $τ,$ and that such robustness increases with the tuning parameter. A whole discussion can be found in the work of Broniatowski et al. [8]. Hence, the same properties hold for RMRPEs.

4. Robust Test Statistics Based on RMRPEs

In this section, we develop two statistics based on the RMRPEs for testing composite null hypothesis, and their asymptotic distributions are obtained. Both procedures are particularized to standard deviation testing (with unknown mean) under normal populations, and explicit expressions of the test statistics are obtained.

4.1. Testing Based on Divergence Measures

In this section, we present the family of Rényi’s pseudodistance test statistics (RPTS) for testing the null hypothesis given in (3). This family of test statistics is given by

(23) $T_{γ} ({\hat{θ}}_{τ}, {\tilde{θ}}_{τ}) = 2 n R_{γ} (f_{{\hat{θ}}_{τ}}, f_{{\tilde{θ}}_{τ}}) .$

The RPTS, $T_{γ} ({\hat{θ}}_{τ}, {\tilde{θ}}_{τ})$ , can be understood as a measure between the best unrestricted estimator of the model parameter, and the best estimator satisfying the null hypothesis. Large values of the RPTS indicate that the model densities associated with the restricted and unrestricted estimators are far away one from the other, and so the null hypothesis is not supported by the observed data. Hence, we should reject $H_{0}$ for large enough $T_{γ} ({\hat{θ}}_{τ}, {\tilde{θ}}_{τ})$ . We can observe that the family of RPTS defined in (23) depends on two tuning parameters, $τ$ and $γ$ . The first is used for estimating the unknown parameters, while the second is applied to obtain the family of test statistics. The following theorem presents the asymptotic distribution of the family of RPTS defined in (23).

Theorem 3.

The asymptotic distribution of $T_{γ} ({\hat{θ}}_{τ}, {\tilde{θ}}_{τ})$ defined in (23) coincides, under the null hypothesis $H_{0}$ given in (3), with the distribution of the random variable

$\sum_{i = 1}^{r} λ_{i}^{τ, γ} (θ_{0}) Z_{i}^{2},$

where $Z_{1}, \dots, Z_{r}$ are independent standard normal variables, $λ_{1}^{τ, γ} (θ_{0}), \dots, λ_{r}^{τ, γ} (θ_{0})$ are the nonzero eigenvalues of $M_{γ, τ} (θ_{0}) = A_{γ} (θ_{0}) B_{τ} (θ_{0}) K_{τ} (θ_{0}) B_{τ} (θ_{0})$ and $k = r .$ The matrices $A_{γ} (θ_{0})$ and $B_{τ} (θ_{0})$ are given by,

(24) $\begin{matrix} A_{γ} (θ_{0}) & = & \frac{S_{γ} (θ_{0})}{κ_{τ} (θ_{0})}, \end{matrix}$

(25) $\begin{matrix} B_{τ} (θ_{0}) & = & Q_{τ} (θ_{0}) G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} . \end{matrix}$

Proof.

See Appendix A. □

Rényi’s Pseudodistance Test Statistics for Normal Populations

Under the $N (μ, σ^{2})$ model, consider the problem of testing

(26) $H_{0} : σ = σ_{0} versus H_{1} : σ \neq σ_{0}$

where

μ

is an unknown nuisance parameter. In this case, the unrestricted and null parameter spaces are given by

Θ = {(μ, σ^{2}) \in R^{2} | μ \in R, σ^{2} \in R^{+}}

and

Θ_{0} = {(μ, σ) \in R^{2} | σ = σ_{0}, μ \in R}

, respectively. If we consider the function

g (θ) = σ - σ_{0},

with

θ = {(μ, σ)}^{T}

, the null hypothesis

H_{0}

can be written as

$H_{0} : g (θ) = 0$

and we are in the situation considered in (26). We can observe that in our case

G (θ) = {(0, 1)}^{T} .

Based on (6) and taking into account the fact that

f_{θ} (x)

is the normal density with mean

μ

and variance

σ^{2}

, the MRPE

{\hat{θ}}_{τ} = {({\hat{μ}}_{τ}, {\hat{σ}}_{τ})}^{T}

θ = {(μ, σ)}^{T}

is the solution of the system of nonlinear equations

$\{\begin{matrix} \sum_{i = 1}^{n} (X_{i} - μ) exp \{- \frac{τ}{2} {(\frac{X_{i} - μ}{σ})}^{2}\} = 0 \\ \sum_{i = 1}^{n} \{{(\frac{X_{i} - μ}{σ})}^{2} - \frac{1}{1 + τ}\} exp \{- \frac{τ}{2} {(\frac{X_{i} - μ}{σ})}^{2}\} = 0 \end{matrix}$

while the RMRPE

{\tilde{θ}}_{β} = {({\tilde{μ}}_{τ}, σ_{0})}^{T},

when

σ = σ_{0}

is the solution of the nonlinear equation

$\sum_{i = 1}^{n} \{{(\frac{X_{i} - μ}{σ_{0}})}^{2} - \frac{1}{1 + τ}\} exp \{- \frac{τ}{2} {(\frac{X_{i} - μ}{σ_{0}})}^{2}\} = 0 .$

After some algebra (see the Appendix A) we obtain that the RPTS for testing (26) under normal populations can be expressed as

(27) $\begin{matrix} T_{γ} ({\hat{θ}}_{τ}, {\tilde{θ}}_{τ}) & = & 2 n R_{γ} (N ({\hat{μ}}_{τ}, {\hat{σ}}_{τ}^{2}), N ({\tilde{μ}}_{τ}, σ_{0})) \\ = & \frac{2 n}{γ (γ + 1)} log [\frac{1}{{\hat{σ}}_{τ} σ_{0}^{γ}} {(\frac{\sqrt{{\hat{σ}}_{τ}^{2} + γ σ_{0}^{2}}}{\sqrt{γ + 1}})}^{γ + 1}] + n \frac{{({\hat{μ}}_{τ} - {\tilde{μ}}_{τ})}^{2}}{(γ σ_{0}^{2} + {\hat{σ}}_{τ}^{2})} \end{matrix}$

Based in (27), and taking into account that the eigenvalue of the matrix

A_{γ} (θ) B_{τ} (θ) K_{τ} (θ) B_{τ} (θ)

is given by (see Appendix A)

$l_{τ, γ} (σ) = \frac{1}{2} \frac{{(τ + 1)}^{3}}{{(γ + 1)}^{2} {(2 τ + 1)}^{\frac{5}{2}}} (3 τ^{2} + 4 τ + 2),$

we apply Theorem 3 such that

$l_{τ, γ} {(σ_{0})}^{- 1} (\frac{2 n}{γ (γ + 1)} log [\frac{1}{{\hat{σ}}_{τ} σ_{0}^{γ}} {(\frac{\sqrt{{\hat{σ}}_{τ}^{2} + γ σ_{0}^{2}}}{\sqrt{γ + 1}})}^{γ + 1}] + n \frac{{({\hat{μ}}_{τ} - {\tilde{μ}}_{τ})}^{2}}{(γ σ_{0}^{2} + {\hat{σ}}_{τ}^{2})}) \overset{L}{\underset{n \to \infty}{\to}} χ_{1}^{2} .$

Note that the RPTS is indexed by two tuning parameters, $γ$ and $τ$ , the first controlling the robustness of the pseudodistance and the second controlling the robustness on the estimation. For simplicity, we use $γ = τ$ for the normal population application.

Remark 1.

For $τ = γ = 0$ , the RPTS coincides with the asymptotic likelihood ratio test for testing (26). Indeed, for $τ = 0,$ we have that the MLE and RMLE are given, respectively, by

$\hat{θ} = (\bar{X}, {\hat{σ}}_{n}^{2} = \frac{1}{n} \sum_{i = 1}^{n} {(X_{i} - \bar{X})}^{2}) and \tilde{θ} = (\bar{X}, σ_{0}^{2}) .$

Now, the expression of the Kullback–Leibler divergence (the RP for $γ = 0$ ) between two normal densities, $N (μ_{1}, σ_{1})$ and $N (μ_{2}, σ_{2}),$ is given by

(28) $lim_{γ \to 0} R_{γ} (N (μ_{1}, σ_{1}), N (μ_{2}, σ_{2})) = \frac{σ_{2}^{2} - σ_{1}^{2}}{2 σ_{1}^{2}} + ln \frac{σ_{1}}{σ_{2}} + \frac{1}{2} \frac{{(μ_{1} - μ_{2})}^{2}}{σ_{1}^{2}} .$

and thus the RPTS for $γ = τ = 0$ is

$T_{0} (\hat{θ}, \tilde{θ}) = n \frac{σ_{0}^{2}}{{\hat{σ}}_{n}^{2}} - n + 2 n ln \frac{{\hat{σ}}_{n}}{σ_{0}} .$

On the other hand, the likelihood ratio for testing (26) is given by

$λ (X_{1}, \dots, X_{n}) = {(\frac{{\hat{σ}}_{n}}{σ_{0}})}^{n / 2} e^{- n \frac{{\hat{σ}}_{n}^{2}}{2 σ_{0}^{2}}} e^{n / 2},$

and so, both expressions are related through

$- 2 ln λ (X_{1}, \dots, X_{n}) = T_{0} (\hat{θ}, \tilde{θ}) .$

4.2. Rao’s-Type Tests Based on RMRPE

Rao test statistics are one of the most popular score test statistics for testing a simple and composite null hypothesis in general statistical models. For the simple null hypothesis testing, it requires no parameter estimation, but for composite ones, the classical Rao test is based on the likelihood score function associated with the restricted MLE (see Rao [23]). Basu et al. [24] generalized Rao’s procedure by using score functions associated with RMDPDEs, bringing in a considerable gain of robustness of the Rao-type test obtained. In this section, we develop Rao-type test statistics based on the score function associated to RMRPEs.

Let us consider the $τ$ -score function associated to the RMRPE,

$ψ_{τ} (x; θ) = f_{θ} {(x)}^{τ} (u_{θ} (x) - c_{τ} (θ)),$

so the estimating equations for the MRPE are given by

$\sum_{i = 1}^{n} ψ_{τ} (x_{i}; θ) = 0_{p} .$

Then, the $τ$ -score statistic can be defined as

$Ψ_{τ} (θ) = \sum_{i = 1}^{n} ψ_{τ} (x_{i}; θ) = {(\sum_{i = 1}^{n} ψ_{τ}^{1} (x_{i}; θ), \dots, \sum_{i = 1}^{n} ψ_{τ}^{k} (x_{i}; θ))}^{T} .$

However, taking expectations in the corresponding quantities, it is not difficult to show that

$\begin{matrix} E [{(\frac{τ}{C_{τ} (θ)} f_{θ} {(X)}^{τ} (u_{θ} (X) - c_{τ} (θ)))}_{θ = θ_{0}}] & = 0_{p} \\ E [{(f_{θ} {(X)}^{2 τ} (u_{θ} (X) - c_{τ} (θ)) {(u_{θ} (X) - c_{τ} (θ))}^{T})}_{θ = θ_{0}}] & = K_{τ} (θ_{0}), \end{matrix}$

where

K_{τ} (θ)

is defined in (16), and so, by the central limit theorem, the

τ

-score statistic is asymptotically normal,

(29) $\frac{1}{\sqrt{n}} Ψ_{τ} (θ) \underset{n \to \infty}{\overset{L}{\to}} N (0_{p}, K_{τ} (θ)) .$

The previous convergence motivates the definition of the Rao-type test statistics.

4.2.1. Rao-Type Test Statistics for Testing Simple Null Hypothesis

We first consider the simple null hypothesis test

(30) $H_{0} : θ = θ_{0} vs . H_{1} : θ \neq θ_{0} .$

Then, the Rao-type test statistics $R_{τ} (θ_{0})$ for testing (30) is defined as

$R_{τ} (θ_{0}) = \frac{1}{n} Ψ_{τ} {(θ_{0})}^{T} K_{τ} {(θ_{0})}^{- 1} Ψ_{τ} (θ_{0}) .$

Note that here the last test statistics depend on $τ$ through the matrices $Ψ_{τ} (θ_{0})$ and $K_{τ} (θ_{0})$ involved in the definition, and again, the robustness of the statistics increases with $τ .$ Moreover, the last matrix may have an explicit expression for certain statistical models, but otherwise it would have to be estimated from the sample.

Further, from (29), we have that, under the null hypothesis,

$R_{τ} (θ_{0}) \underset{n \to \infty}{\overset{L}{\to}} χ_{p}^{2}$

with p being the dimension of the parameter space. Then, the null hypothesis is rejected if

R_{τ} (θ_{0}) > χ_{p, α}^{2}

, where

χ_{p, α}^{2}

denotes the upper

α

-quantile of a chi-square distribution with p degrees of freedom.

4.2.2. Rao-Type Test Statistics for Testing Composite Null Hypothesis

Next, let us consider composite null hypothesis of the form

(31) $H_{0} : g (θ) = 0_{r} vs . H_{1} : g (θ) \neq 0_{r},$

where the function

g : R^{p} \to R^{r}

is a differentiable vector-valued function. Then, any vector

θ

satisfying the null hypothesis belongs to a restricted parameter space given in (3). The generalized Rao-type test statistic associated to the RMRPE with tuning parameter

τ

{\tilde{θ}}_{τ},

for testing (31) is given by

(32) $R_{τ} ({\tilde{θ}}_{τ}) = \frac{1}{n} Ψ_{τ} {({\tilde{θ}}_{τ})}^{T} Q_{τ} ({\tilde{θ}}_{τ}) {[Q_{τ} {({\tilde{θ}}_{τ})}^{T} K_{τ} ({\tilde{θ}}_{τ}) Q_{τ} ({\tilde{θ}}_{τ})]}^{- 1} Q_{τ} {({\tilde{θ}}_{τ})}^{T} Ψ_{τ} ({\tilde{θ}}_{τ}) .$

Using similar arguments to Basu et al. [24], it is possible to show that, under general regularity conditions, the Rao-type test statistics $R_{τ} ({\tilde{θ}}_{τ})$ have an asymptotic chi-square distribution with r degrees of freedom under the null hypothesis given in (31). Therefore, the rejection region of the test is given by

${X_{1}, \dots, X_{n} : R_{τ} ({\tilde{θ}}_{τ}) > χ_{r, α}^{2}} .$

Again, the tuning parameter $τ$ controls the trade-off between efficiency and robustness of the test. Indeed, for $τ = 0$ , the generalized Rao type test statistic $R_{τ = 0} ({\tilde{θ}}_{0})$ coincides with the classical Rao test for composite null hypothesis.

4.2.3. Rao Test for Normal Populations

Consider the test defined in (26) for testing the standard deviation value of a normal population with unknown mean. The explicit expression of the main matrices involved in the definition (32) for such testing procedure and assumed parametric model is given by

$\begin{matrix} ψ_{τ} (X; (μ, σ)) & = {(\frac{X - μ}{σ^{2}} \frac{1}{{(σ \sqrt{2 π})}^{τ}} e^{- \frac{τ}{2} {(\frac{X - μ}{σ})}^{2}}, ({(\frac{X - μ}{σ})}^{2} - \frac{1}{1 + τ}) \frac{1}{σ} \frac{1}{{(σ \sqrt{2 π})}^{τ}} e^{- \frac{τ}{2} {(\frac{X - μ}{σ})}^{2}})}^{T}, \\ K_{τ} ((μ, σ)) & = \frac{1}{σ^{2}} \frac{1}{{(σ \sqrt{2 π})}^{2 τ} {(1 + 2 τ)}^{3 / 2}} (\begin{matrix} 1 & 0 \\ 0 & \frac{3 τ^{2} + 2 + 4 τ}{{(1 + τ)}^{2} (1 + 2 τ)} \end{matrix}), \\ Q_{τ} ((μ, σ)) & = (\begin{matrix} 0 \\ 1 \end{matrix}) . \end{matrix}$

The step-by-step calculation of such values are detailed in the Appendix A. Then, the Rao-type test for composite null hypothesis of the form (31) is given by

$R_{τ} (\tilde{μ}) = \frac{1}{n} \frac{{(1 + 2 τ)}^{3 / 2} {(1 + τ)}^{2} (1 + 2 τ)}{3 τ^{2} + 4 τ + 2} {[\sum_{i = 1}^{n} ({(\frac{x_{i} - \tilde{μ}}{σ_{0}})}^{2} - \frac{1}{τ + 1}) e^{- \frac{τ}{2} {(\frac{x_{i} - \tilde{μ}}{σ_{0}})}^{2}}]}^{2}$

where

({\tilde{μ}}_{τ}, σ_{0})

denotes the RMRPE with tuning parameter

τ .

Note that, for

τ = 0

{\tilde{μ}}_{τ = 0} = \bar{X} .

Then, the Rao-type test statistic based on RMRPE with

τ = 0

(the restricted MLE) coincides with the classical Rao test.

5. Simulation Study: Application to Normal Populations

In this section, we empirically analyze the performance of the proposed estimators under the normal parametric model and RPTS and Rao-type test statistics for the problem of testing (26) in terms of efficiency and robustness. We examine the accuracy of the RMRPEs, and we further examine the robustness properties of both families of estimators under different contamination scenarios. Further, we investigate the empirical level and power of the proposed test statistics under different sample sizes and contamination scenarios.

Let us consider a univariate normal model with true parameter value $θ_{0} = (μ = 0,$ $σ = 1),$ and the problem of testing

(33) $H_{0} : σ = 1 vs . H_{1} : σ \neq 1 .$

The restricted parameter space is then given by

$Θ_{0} = {(μ, 1) : μ \in R} .$

In order to evaluate the robustness properties of the estimators and test statistics, we introduce contamination in data by replacing a $ε %$ of the observations by a contaminated sample, where $ε$ denotes the contamination level. We generate five different scenarios of contamination:

Pure data.
Scenario 1: Slightly contaminated data. We replace a $ε %$ of the samples by a contaminated sample from a normal distribution, $N (0, \sqrt{3}) .$
Scenario 2: Heavily contaminated data. We replace a $ε %$ of the samples by a contaminated sample from a normal distribution, $N (0, \sqrt{5})$

Further, in order to evaluate the power of the test, we consider an alternative true parameter value $θ_{1} = (0, 0.7)$ which does not satisfy the null hypothesis (33) (or equivalently the restrictions of the parameter space). In this scenario, contaminated parameters are set $θ_{1} = (0, 1.2)$ for slightly and $θ_{1} = (0, 1.5)$ for heavily contamination.

Figure 1 shows the root mean square error (RMSE) of the RMRPE of the scale parameter $σ$ , for different values of the tuning parameter $τ = 0, 0.2, 0.4, 0.6$ and $τ = 0.8$ over $R = 10, 000$ replications. As expected, large values of the tuning parameter produce more robust estimators, which is particularly advantageous for the heavily contaminated scenario. Furthermore, even when introducing very low levels of contamination in data, $ε = 5 %,$ the RMRPE with moderate value of the tuning parameter outperforms the classical MLE, without a significant loss of efficiency in the absence of contamination.

On the other hand, Figure 2 presents the empirical level and power of both RPTS and Rao-type test statistics based on RMRPEs for different values of the tuning parameter, $τ = 0, 0.2, 0.4, 0.6, 0.8,$ under increasing contamination levels. The empirical level and power are computed as the mean number of rejections over $R = 10, 000$ replications. The empirical level produced by the classical ratio and Rao-type tests rapidly increases and separates from levels obtained with any robust test. Regarding the empirical power, all robust tests with moderate and large values of the tuning parameter outperform the classical estimator within their family under contaminated scenarios, but Rao-type test statistics based on RMRPEs are more conservative than RPTSs, thus exhibiting lower levels and powers. Then, the proposed test statistics provides an appealing alternative to classical likelihood ratio and Rao tests, with a small loss of efficiency in favor of a clear gain in terms of robustness.

On the other hand, the sample size could play a crucial role in the performance of the tests, even more accentuated when there exists data contamination. Figure 3 shows the sample size effect on the performance of the tests in terms of empirical level, under a $10 %$ of contamination level in data. As discussed, Rao-type test statistics based on RMRPEs is more conservative and so tests based on RMRPEs with positive values of the tuning parameter produce lower empirical levels. Here, it outperforms the poor performance of the classical Rao-type test statistics with respect to any other. Moreover, when the sample size increases, the performance gap between non-robust and robust methods is widening.

Following the discussions in the preceding sections, larger values of the tuning parameter produce more robust but less efficient estimators. Therefore, the optimal value of $τ$ should obtain the best trade-off between efficiency and robustness. Warwick and Jones [25] first introduced a useful data-based procedure for the choice of the tuning parameter for the MDPDE based on minimizing the asymptotic MSE of the estimator. However, this method depends on the choice of a pilot estimator, and Basak et al. [26] improved the method by removing the dependency on an initial estimator. The proposed algorithm was developed ad hoc for the MDPDE, but it can be easily adapted to the MRPE and RMRPE by simply substituting the expression of the variance of the MDPDE by the variance of the MRPPE or the RMRPE, respectively.

6. Real Data Application

Finally, we illustrate the outperformance of the proposed test statistics in two real data applications, where the gathered information contains some outlying observations. Both real dataset are modeled under the normal model, and hypothesis tests on the standard deviation of the population are performed.

6.1. Telephone-Fault Data

We consider the data on telephone line faults presented and analyzed by Welch [27] and Simpson [4]. The dataset consist of $n = 14$ ordered differences between the inverse test rates and the inverse control rates in matched pairs of areas,

$- 988, - 135, - 78, 3, 59, 83, 93, 110, 189, 197, 204, 229, 289, 310 .$

Basu et al. [24,28] modeled these differences as a normal random variable and pointed out that the first observation is a clear outlier, as its value is distant from the rest of the data. They tested simple and composite null hypotheses for the mean under the normal model, as well as a simple null hypothesis assuming a known mean. Here, we propose to test for the standard deviation of the normal distribution. Note that, computing the MLE of the sample with full and clean data (after removing the outlying observation), we obtain

(\hat{μ}, \hat{σ}) = (40.36, 323.08),

and

(\hat{μ}, \hat{σ}) = (119.46, 134.82),

respectively. Accordingly, the outlier clearly influences the model parameter estimates, playing a crucial role on the rejection of any null hypothesis. We consider the composite null hypothesis

(34) $H_{0} : σ = 135 vs . H_{1} : σ \neq 135,$

where the value

σ = 135

has been chosen according to the estimation with clean data.

Figure 4 presents the RPTS (top) and Rao (bottom) test statistics (left) and p-values (right) for the telephone data against increasing tuning parameters. While it is clearly seen that both classical tests fail to not reject the null hypothesis when fitting the model with the original data, the decision turns around sharply as the tuning parameter $τ$ crosses and goes beyond $0.2$ for the RPTS and $0.15$ for Rao-type test statistics based on MRPEs. On the other hand, the decision of not rejecting is agreed by all statistics when fitting the model with clean data. This example illustrates the great applicability of the robust methods, which are not too affected by a such outlying observation, and the good performance of the proposed statistics under contaminated observations, which stay stable.

6.2. Darwin’s Plant Fertilization Data

Darwin [29] performed an experiment to determine whether self-fertilized plants and cross-fertilized plants have different growth rates. He sowed in pots pairs of Zea mays plants, one self-fertilized and the other cross-fertilized, and after a specific time period, the height of each plant was measured. A particular sample of $n = 15$ pairs of plants led to the following paired differences (cross-fertilized minus self-fertilized).

$- 67, - 48, 6, 8, 14, 16, 23, 24, 28, 29, 41, 49, 56, 60, 75$

A parametric approach to analyze the data as a random sample from a normal distribution with unknown mean and standard deviation was developed by Basu et al. [24]. Here, there is not any huge outlying observation, but the first two observations seem to be distant from the rest of the sample, influencing the model parameter estimates and test decisions. Indeed, the MLE, computing with original data, is $(\hat{μ}, \hat{σ}) = (20.93, 37.74),$ while the MLE, when removing the two first observations, switches to $(\hat{μ}, \hat{σ}) = (33, 21.54) .$ Therefore, removing influential observations may alter the decision of a test. According to these results, we consider the testing problem

(35) $H_{0} : σ = 23 vs . H_{1} : σ \neq 23 .$

Figure 5 shows the test statistics (left) and corresponding p-values (right) for the two families of statistics considered, the RPTS (top) and Rao-type test statistics (bottom) against the tuning parameter value $τ$ . Again, test statistics based on RMRPE with large enough tuning parameters do not reject the null hypothesis, unlike tests based on low values of $τ = 0$ , including the RMLE. The disagreement departs when using the clean data, as all tests agree on not rejecting the null hypothesis.

7. Concluding Remarks

In this paper, we presented for the first time the family of RMRPEs. We derived their asymptotic distribution, and proved some suitable properties as consistency under the parameter restriction and robustness against data contamination. Further, based on these RMRPEs, we generalized two important families of statistics, namely RPTS and Rao-type tests, for testing a composite null hypothesis. Moreover, we obtained some explicit expressions of the RMPREs, RPTS and Rao-type test statistics for testing the variance under a normal population with an unknown mean. It was empirically shown that the proposed RPTS and Rao-type test statistics are robust, unlike classical tests based on the MLE, under normal populations. Indeed, the robustness of the tests is controlled by a tuning parameter $τ$ , and so larger values of $τ$ produce more robust estimators (although less efficient). Finally, some classical numerical examples illustrate the theoretical properties and applicability of the proposed methods.

Author Contributions

Conceptualization, M.J., P.M. and L.P.; methodology, M.J., P.M. and L.P.; software, M.J., P.M. and L.P.; validation, M.J., P.M. and L.P.; formal analysis, M.J., P.M. and L.P.; investigation, M.J., P.M. and L.P.; resources, M.J., P.M. and L.P.; data curation, M.J., P.M. and L.P.; writing—original draft preparation, M.J., P.M. and L.P.; writing—review and editing, M.J., P.M. and L.P.; visualization, M.J., P.M. and L.P.; supervision, M.J., P.M. and L.P.; project administration, M.J., P.M. and L.P.; funding acquisition, M.J., P.M. and L.P. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We are very grateful to the referees and associate editor for their helpful comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

DPD	Density Power Divergence
IF	Influence Function
KL	Kullback–Leibler
LRM	Linear Regression Model
MLE	Maximum Likelihood Estimator
MDPDE	Minimum Density Power Divergence Estimator
MRPE	Minimum Rényi Pseudodistance Estimator
RMRPE	Restricted minimum Rényi Pseudodistance Estimator
RP	Rényi Pseudodistance
RPTS	Rényi Pseudodistance test statistic

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

View Image - Figure 1. RMSE of the RMRPE under increasing contamination levels (slightly contaminated at left and heavily contaminated at right) for different values of the tuning parameter [Forumla omitted. See PDF.] over [Forumla omitted. See PDF.] replications. (a) Scenario 1, (b) Scenario 2.

Figure 1. RMSE of the RMRPE under increasing contamination levels (slightly contaminated at left and heavily contaminated at right) for different values of the tuning parameter [Forumla omitted. See PDF.] over [Forumla omitted. See PDF.] replications. (a) Scenario 1, (b) Scenario 2.

View Image - Figure 2. Empirical level and power under increasing contamination (slightly contaminated at left and heavily contaminated at right) over [Forumla omitted. See PDF.] repetitions. (a) Scenario 1, (b) Scenario 2.

Figure 2. Empirical level and power under increasing contamination (slightly contaminated at left and heavily contaminated at right) over [Forumla omitted. See PDF.] repetitions. (a) Scenario 1, (b) Scenario 2.

View Image - Figure 3. Empirical level under increasing sample sizes for [Forumla omitted. See PDF.] of contamination level (slightly contaminated at left and heavily contaminated at right) over [Forumla omitted. See PDF.] repetitions. (a) Scenario 1, (b) Scenario 2.

Figure 3. Empirical level under increasing sample sizes for [Forumla omitted. See PDF.] of contamination level (slightly contaminated at left and heavily contaminated at right) over [Forumla omitted. See PDF.] repetitions. (a) Scenario 1, (b) Scenario 2.

View Image - Figure 4. RPTS (top) and Rao-type test statistics (bottom), jointly with their associated p-valuess (right), for testing (34) with original and cleaned (after outliers removal) telephone-fault data.

Figure 4. RPTS (top) and Rao-type test statistics (bottom), jointly with their associated p-valuess (right), for testing (34) with original and cleaned (after outliers removal) telephone-fault data.

View Image - Figure 5. RPTS (top) and Rao-type test statistics (bottom), jointly with their associated p-values (right), for testing (35) with original and cleaned (after outliers removal) Darwing data.

Figure 5. RPTS (top) and Rao-type test statistics (bottom), jointly with their associated p-values (right), for testing (35) with original and cleaned (after outliers removal) Darwing data.

Appendix A

Appendix A.1. Proof of Theorem 2

We denote $h_{n} (θ) = \frac{1}{n} \sum_{i = 1}^{n} \frac{f_{θ} {(X_{i})}^{τ}}{C_{τ} (θ)} .$

Differentiating both sides of the equality, we have $\frac{\partial h_{n} (θ)}{\partial θ} = \frac{τ}{C_{τ} (θ)} \frac{1}{n} \sum_{i = 1}^{n} f_{θ} {(X_{i})}^{τ} (u_{θ} (X_{i}) - c_{τ} (θ)) .$

Now we establish that ${(\frac{\partial^{2} h_{n} (θ)}{\partial θ \partial θ^{T}})}_{θ = θ_{0}} \underset{n \to \infty}{\overset{P}{\to}} - \frac{τ}{C_{τ} (θ)} S_{τ} (θ_{0}) .$

We have $\begin{matrix} \frac{\partial^{2} h_{n} (θ)}{\partial θ \partial θ^{T}} & = & \frac{1}{C_{τ} {(θ)}^{2}} \{\frac{1}{n} \sum_{i = 1}^{n} [(τ^{2} f_{θ} {(X_{i})}^{τ} u_{θ} (X_{i}) u_{θ} {(X_{i})}^{T} + τ f_{θ} {(X_{i})}^{τ} \frac{\partial u_{θ} (X_{i})}{\partial θ^{T}}) C_{τ} (θ) \\ - τ C_{τ} (θ) c_{τ} (θ) τ f_{θ} {(X_{i})}^{τ} u_{θ} {(X_{i})}^{T}]\} \\ - \frac{1}{C_{τ} {(θ)}^{2}} \{\frac{1}{n} \sum_{i = 1}^{n} [(τ \frac{\partial c_{τ} (θ)}{\partial θ^{T}} f_{θ} {(X_{i})}^{τ} + τ^{2} f_{θ} {(X_{i})}^{τ} c_{τ} (θ) u_{θ} {(X_{i})}^{T}) C_{τ} (θ) \\ - τ C_{τ} (θ) c_{τ} (θ) τ c_{τ} {(θ)}^{T} f_{θ} {(X_{i})}^{τ}]\} \\ = & \frac{1}{C_{τ} (θ)} \{\frac{1}{n} \sum_{i = 1}^{n} [τ^{2} f_{θ} {(X_{i})}^{τ} u_{θ} (X_{i}) u_{θ} {(X_{i})}^{T} + τ f_{θ} {(X_{i})}^{τ} \frac{\partial u_{θ} (X_{i})}{\partial θ^{T}} \\ - τ^{2} c_{τ} {(θ)}^{T} f_{θ} {(X_{i})}^{τ} u_{θ} {(X_{i})}^{T} - τ \frac{\partial c_{τ} (θ)}{\partial θ^{T}} f_{θ} {(X_{i})}^{τ} \\ - τ^{2} c_{τ} (θ) f_{θ} {(X_{i})}^{τ} u_{θ} {(X_{i})}^{T} - τ^{2} c_{τ} (θ) c_{τ} {(θ)}^{T} f_{θ} {(X_{i})}^{τ}]\} . \end{matrix}$

As $n \to \infty$ , we have ${(\frac{\partial^{2} h_{n} (θ)}{\partial θ \partial θ^{T}})}_{θ = θ_{0}} \underset{n \to \infty}{\overset{P}{\to}} T (θ_{0})$ with $T (θ_{0})$ being the matrix given by $\begin{matrix} T (θ_{0}) & = & \frac{1}{C_{τ} (θ)} \{τ^{2} \int f_{θ} {(x)}^{τ + 1} u_{θ} (x) u_{θ} {(x)}^{T} d x + τ \int f_{θ} {(x)}^{τ + 1} \frac{\partial u_{θ} (x)}{\partial θ^{T}} d x \\ - τ^{2} c_{τ} (θ) \int f_{θ} {(x)}^{τ + 1} u_{θ} {(x)}^{T} d x - τ \frac{\partial c_{τ} (θ)}{\partial θ^{T}} \int f_{θ} {(x)}^{τ + 1} d x \\ + τ^{2} c_{τ} (θ) \int f_{θ} {(x)}^{τ + 1} u_{θ} {(x)}^{T} d x - τ^{2} c_{τ} (θ) c_{τ} {(θ)}^{T} \int f_{θ} {(x)}^{τ + 1} d x\} . \end{matrix}$

From the above, after some algebra, we obtain $\begin{matrix} T (θ_{0}) & = & \frac{1}{C_{τ} (θ)} \{τ^{2} \int f_{θ} {(x)}^{τ + 1} u_{θ} (x) u_{θ} {(x)}^{T} d x + τ \int f_{θ} {(x)}^{τ + 1} \frac{\partial u_{θ} (x)}{\partial θ^{T}} d x \\ - τ \frac{\partial c_{τ} (θ)}{\partial θ^{T}} \int f_{θ} {(x)}^{τ + 1} d x - τ^{2} c_{τ} {(θ)}^{T} c_{τ} (θ) \int f_{θ} {(x)}^{τ + 1} d x\} . \end{matrix}$

On the other hand, it not difficult to establish that $\begin{matrix} \frac{\partial c_{τ} (θ)}{\partial θ^{T}} & = & (τ + 1) \frac{\int f_{θ} {(x)}^{τ + 1} u_{θ} (x) u_{θ} {(x)}^{T} d x}{\int f_{θ} {(x)}^{τ + 1} d x} + \frac{\int f_{θ} {(x)}^{τ + 1} \frac{\partial u_{θ} (x)}{\partial θ^{T}} d x}{\int f_{θ} {(x)}^{τ + 1} d x} \\ - (τ + 1) \frac{\int f_{θ} {(x)}^{τ + 1} u_{θ} (x) d x \int f_{θ} {(x)}^{τ + 1} u_{θ} {(x)}^{T} d x}{{(\int f_{θ} {(x)}^{τ + 1} d x)}^{2}} . \end{matrix}$

Therefore we have $\begin{matrix} - τ \frac{\partial c_{τ} (θ)}{\partial θ^{T}} \int f_{θ} {(x)}^{τ + 1} d x & = & - τ (τ + 1) \int f_{θ} {(x)}^{τ + 1} u_{θ} (x) u_{θ} {(x)}^{T} d x - τ \int f_{θ} {(x)}^{τ + 1} \frac{\partial u_{θ} (x)}{\partial θ^{T}} \\ + τ (τ + 1) \frac{\int f_{θ} {(x)}^{τ + 1} u_{θ} (x) d x \int f_{θ} {(x)}^{τ + 1} u_{θ} {(x)}^{T} d x}{\int f_{θ} {(x)}^{τ + 1} d x} . \end{matrix}$

Finally, $\begin{matrix} T (θ_{0}) & = & \frac{1}{C_{τ} (θ)} \{τ^{2} \int f_{θ} {(x)}^{τ + 1} u_{θ} (x) u_{θ} {(x)}^{T} d x + τ \int f_{θ} {(x)}^{τ + 1} \frac{\partial u_{θ} (x)}{\partial θ^{T}} d x \\ - τ (τ + 1) \int f_{θ} {(x)}^{τ + 1} u_{θ} (x) u_{θ} {(x)}^{T} d x - τ \int f_{θ} {(x)}^{τ + 1} \frac{\partial u_{θ} (x)}{\partial θ^{T}} \\ + τ (τ + 1) \frac{\int f_{θ} {(x)}^{τ + 1} u_{θ} (x) d x \int f_{θ} {(x)}^{τ + 1} u_{θ} {(x)}^{T} d x}{\int f_{θ} {(x)}^{τ + 1} d x} - τ^{2} c_{τ} (θ) c_{τ} {(θ)}^{T} \int f_{θ} {(x)}^{τ + 1} d x\} \\ = & \frac{1}{C_{τ} (θ)} \{- τ \int f_{θ} {(x)}^{τ + 1} u_{θ} (x) u_{θ} {(x)}^{T} d x + τ \frac{\int f_{θ} {(x)}^{τ + 1} u_{θ} (x) d x \int f_{θ} {(x)}^{τ + 1} u_{θ} {(x)}^{T} d x}{\int f_{θ} {(x)}^{τ + 1} d x}\} \\ = & - \frac{τ}{C_{τ} (θ)} S (θ_{0}) . \end{matrix}$

On the other hand, $\sqrt{n} \frac{\partial h_{n} (θ)}{\partial θ} = \frac{τ}{C_{τ} (θ)} \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} f_{θ} {(X_{i})}^{τ} (u_{θ} (X_{i}) - c_{τ} (θ)) \underset{n \to \infty}{\overset{L}{\to}} N (0_{p}, {(\frac{τ}{C_{τ} (θ)})}^{2} K_{τ} (θ_{0})),$ as $E [{(\frac{τ}{C_{τ} (θ)} f_{θ} {(X)}^{τ} (u_{θ} (X) - c_{τ} (θ)))}_{θ = θ_{0}}] = 0_{p}$ and $C o v [{(\frac{τ}{C_{τ} (θ)} f_{θ} {(X)}^{τ} (u_{θ} (X) - c_{τ} (θ)))}_{θ = θ_{0}}] = {(\frac{τ}{C_{τ} (θ_{0})})}^{2} K_{τ} (θ_{0})$

Then, the RMRPE estimator of $θ$ , ${\tilde{θ}}_{τ}$ , must satisfy (A1) $\{\begin{matrix} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = {\tilde{θ}}_{τ}} + G ({\tilde{θ}}_{τ}) λ_{n} = 0_{p}, \\ g ({\tilde{θ}}_{τ}) = 0_{r}, \end{matrix}$ where $λ_{n}$ is a vector of Lagrangian multipliers. Now we consider $θ_{n} = θ_{0} + m n^{- 1 / 2}$ , with $| | m | | < k$ , for $0 < k < \infty$ . We have, $\frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{n}} = \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} + \frac{\partial}{\partial θ^{T}} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} (θ_{n} - θ_{0}) + o (| | θ_{n} - θ_{0} {| |}^{2})$ and (A2) $n^{1 / 2} {\frac{\partial}{\partial θ} h_{n} (θ)|}_{θ = θ_{n}} = n^{1 / 2} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} - \frac{\partial}{\partial θ^{T}} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} n^{1 / 2} (θ_{n} - θ_{0}) + o (n^{1 / 2} | | θ_{n} - θ_{0} {| |}^{2}) .$

However, $o (n^{1 / 2} | | θ_{n} - θ_{0} {| |}^{2}) = o (n^{1 / 2} {| | m | |}^{2} / n) = o (n^{- 1 / 2} {| | m | |}^{2}) = o (O_{p} (1)) = o_{p} (1) .$

Since $lim_{n \to \infty} \frac{\partial}{\partial θ^{T}} \frac{\partial}{\partial θ} h_{n} {(θ) |}_{θ = θ_{0}} = - \frac{τ}{C_{τ} (θ)} S_{τ} (θ_{0})$ we obtain (A3) $n^{1 / 2} {\frac{\partial}{\partial θ} h_{n} (θ)|}_{θ = θ_{n}} = n^{1 / 2} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} + \frac{τ}{C_{τ} (θ)} S_{τ} (θ_{0}) n^{1 / 2} (θ_{n} - θ_{0}) + o_{p} (1) .$

Now, we know that (A4) $n^{1 / 2} g (θ_{n}) = G {(θ_{0})}^{T} n^{1 / 2} (θ_{n} - θ_{0}) + o_{p} (1) .$

Further, the RMRPE ${\tilde{θ}}_{τ}$ must satisfy the conditions in (A1), and in view of (A3) and (A4) we have (A5) $n^{1 / 2} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} + \frac{τ}{C_{τ} (θ)} S_{τ} (θ_{0}) n^{1 / 2} ({\tilde{θ}}_{τ} - θ_{0}) + G (θ_{0}) n^{1 / 2} λ_{n} + o_{p} (1) = 0_{p} .$

From (A4) it follows that (A6) $G {(θ_{0})}^{T} n^{1 / 2} ({\tilde{θ}}_{τ} - θ_{0}) + o_{p} (1) = 0_{r} .$

Now we can express Equations (A5) and (A6) in matrix form as $(\begin{matrix} \frac{τ}{C_{τ} (θ_{0})} S_{τ} (θ_{0}) & G (θ_{0}) \\ G {(θ_{0})}^{T} & 0_{r \times r} \end{matrix}) (\begin{matrix} n^{1 / 2} ({\tilde{θ}}_{τ} - θ_{0}) \\ n^{1 / 2} λ_{n} \end{matrix}) = (\begin{matrix} - n^{1 / 2} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} \\ 0_{r} \end{matrix}) + o_{p} (1) .$

Therefore $(\begin{matrix} n^{1 / 2} ({\tilde{θ}}_{τ} - θ_{0}) \\ n^{1 / 2} λ_{n} \end{matrix}) = {(\begin{matrix} \frac{τ}{C_{τ} (θ_{0})} S_{τ} (θ_{0}) & G (θ_{0}) \\ G {(θ_{0})}^{T} & 0_{r \times r} \end{matrix})}^{- 1} (\begin{matrix} - n^{1 / 2} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} \\ 0_{r} \end{matrix}) + o_{p} (1) .$

However, ${(\begin{matrix} \frac{τ}{C_{τ} (θ_{0})} S_{τ} (θ_{0}) & G (θ_{0}) \\ G {(θ_{0})}^{T} & 0 \end{matrix})}^{- 1} = (\begin{matrix} {(\frac{τ}{C_{τ} (θ_{0})})}^{- 1} P_{τ}^{*} (θ_{0}) & Q_{τ} (θ_{0}) \\ Q_{τ} {(θ_{0})}^{T} & R_{τ} (θ_{0}) \end{matrix}),$ where $P_{τ}^{*} (θ_{0})$ and $Q_{τ} (θ_{0})$ are defined in (18) and (19), respectively. The matrix $R_{τ} (θ_{0})$ is the quantity needed to make the right hand side of the above equation equal to the indicated inverse. Then, (A7) $n^{1 / 2} ({\tilde{θ}}_{τ} - θ_{0}) = - {(\frac{τ}{C_{τ} (θ)})}^{- 1} P_{τ}^{*} (θ_{0}) n^{1 / 2} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} + o_{p} (1),$ and we know (A8) $n^{1 / 2} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} \underset{n ⟶ \infty}{\overset{L}{⟶}} N (0, {(\frac{τ}{C_{τ} (θ_{0})})}^{2} K_{τ} (θ_{0})) .$

Now by (A7) and (A8), we have the desired result.

Appendix A.2. Proof of Theorem 3

Consider the expression $R_{γ} (f_{θ}, f_{{\tilde{θ}}_{τ}})$ . A Taylor expansion for an arbitrary $θ \in Θ$ , around ${\tilde{θ}}_{τ}$ leads to the relation $\begin{matrix} R_{γ} (f_{θ}, f_{{\tilde{θ}}_{τ}}) & = R_{γ} (f_{{\tilde{θ}}_{τ}}, f_{{\tilde{θ}}_{τ}}) + {(\frac{\partial R_{γ} (f_{θ}, f_{{\tilde{θ}}_{τ}})}{\partial θ})}_{θ = {\tilde{θ}}_{τ}} (θ - {\tilde{θ}}_{τ}) \\ + \frac{1}{2} (θ - {\tilde{θ}}_{τ}) {(\frac{\partial^{2} R_{γ} (f_{θ}, f_{{\tilde{θ}}_{τ}})}{\partial θ \partial θ^{T}})}_{θ = {\tilde{θ}}_{τ}} {(θ - {\tilde{θ}}_{τ})}^{T} + o ({∥θ - {\tilde{θ}}_{τ}∥}^{2}) . \end{matrix}$

It is clear that $R_{γ} (f_{{\tilde{θ}}_{τ}}, f_{{\tilde{θ}}_{τ}}) = 0$ and $\frac{\partial R_{γ} (f_{θ}, f_{{\tilde{θ}}_{τ}})}{\partial θ} = \frac{\partial L_{γ}^{1} (θ)}{\partial θ} - \frac{\partial L_{γ}^{2} (θ)}{\partial θ},$ being $L_{γ}^{1} (θ) = \frac{1}{γ + 1} log (\int f_{θ} {(x)}^{γ + 1} d x)$ and $L_{γ}^{2} (θ) = \frac{1}{γ} log (\int f_{θ} {(x)}^{γ} f_{{\tilde{θ}}_{τ}} (x) d x) .$

Then, $\frac{\partial L_{γ}^{1} (θ)}{\partial θ} = \frac{\int f_{θ} {(x)}^{γ + 1} u_{θ} (x) d x}{\int f_{θ} {(x)}^{γ + 1} d x} and \frac{\partial L_{γ}^{2} (θ)}{\partial θ} = \frac{\int f_{θ} {(x)}^{γ} u_{θ} (x) f_{{\tilde{θ}}_{τ}} (x) d x}{\int f_{θ} {(x)}^{γ} f_{{\tilde{θ}}_{τ}} (x) d x} .$

Therefore, ${(\frac{\partial R_{γ} (f_{θ}, f_{{\tilde{θ}}_{τ}})}{\partial θ})}_{θ = {\tilde{θ}}_{τ}} = 0 .$

Regarding the second derivatives, we have $\begin{matrix} \frac{\partial^{2} L_{γ}^{1} (θ)}{\partial θ \partial θ^{T}} & = & (γ + 1) \frac{\int f_{θ} {(x)}^{γ + 1} u_{θ} (x) u_{θ} {(x)}^{T} d x}{\int f_{θ} {(x)}^{γ + 1} d x} + \frac{\int f_{θ} {(x)}^{γ + 1} \frac{\partial u_{θ} (x)}{\partial θ^{T}}}{\int f_{θ} {(x)}^{γ + 1} d x} \\ - (γ + 1) \frac{\int f_{θ} {(x)}^{γ + 1} u_{θ} (x) d x \int f_{θ} {(x)}^{γ + 1} u_{θ} {(x)}^{T} d x}{{(\int f_{θ} {(x)}^{γ + 1} d x)}^{2}} \end{matrix}$ and $\begin{matrix} \frac{\partial^{2} L_{γ}^{2} (θ)}{\partial θ \partial θ^{T}} & = & γ \frac{\int f_{θ} {(x)}^{γ} u_{θ} (x) u_{θ} {(x)}^{T} f_{{\tilde{θ}}_{τ}} (x) d x}{\int f_{θ} {(x)}^{γ} f_{{\tilde{θ}}_{τ}} (x) d x} + \frac{\int f_{θ} {(x)}^{γ} \frac{\partial u_{θ} (x)}{\partial θ^{T}} f_{{\tilde{θ}}_{τ}} (x) d x}{\int f_{θ} {(x)}^{γ} f_{{\tilde{θ}}_{τ}} (x) d x} \\ - γ \frac{\int f_{θ} {(x)}^{γ} f_{{\tilde{θ}}_{τ}} (x) u_{θ} (x) d x \int f_{θ} {(x)}^{γ} f_{{\tilde{θ}}_{τ}} (x) u_{θ} {(x)}^{T} d x}{{(\int f_{θ} {(x)}^{γ} f_{{\tilde{θ}}_{τ}} (x) d x)}^{2}} . \end{matrix}$ and so ${(\frac{\partial^{2} R_{γ} (f_{θ}, f_{{\tilde{θ}}_{τ}})}{\partial θ \partial θ^{T}})}_{θ = {\tilde{θ}}_{τ}} = \frac{S_{γ} ({\tilde{θ}}_{τ})}{κ_{γ} ({\tilde{θ}}_{τ})}$

Therefore, $T_{γ} ({\hat{θ}}_{τ}, {\tilde{θ}}_{τ}) = 2 n R_{γ} (f_{\hat{θ}}, f_{{\tilde{θ}}_{τ}}) = n^{1 / 2} {({\hat{θ}}_{τ} - {\tilde{θ}}_{τ})}^{T} \frac{S_{γ} ({\tilde{θ}}_{τ})}{κ_{γ} ({\tilde{θ}}_{τ})} n^{1 / 2} ({\hat{θ}}_{τ} - {\tilde{θ}}_{τ}) + n \times o ({∥{\hat{θ}}_{τ} - {\tilde{θ}}_{τ})∥}^{2}) .$

Under $θ_{0} \in Θ_{0}$ , $\frac{S_{γ} ({\tilde{θ}}_{τ})}{κ_{γ} ({\tilde{θ}}_{τ})} \underset{n ⟶ \infty}{\overset{P}{⟶}} \frac{S_{γ} (θ_{0})}{κ_{τ} (θ_{0})} .$

Based on ${\hat{θ}}_{τ}$ and using by (A4) and (A5), we have that $n^{1 / 2} \frac{\partial}{\partial θ} {h_{n} (θ)|}_{θ = θ_{0}} = - \frac{τ}{C_{τ} (θ_{0})} n^{1 / 2} S_{τ} (θ_{0}) ({\hat{θ}}_{τ} - θ_{0}) + o_{p} (1),$ and using (A7), we obtain $\begin{matrix} n^{1 / 2} ({\tilde{θ}}_{τ} - θ_{0}) & = P_{τ}^{*} (θ_{0}) n^{1 / 2} S_{τ} (θ_{0}) ({\hat{θ}}_{τ} - θ_{0}) + o_{p} (1) \\ = n^{1 / 2} ({\hat{θ}}_{τ} - θ_{0}) - Q_{τ} (θ_{0}) G {(θ_{0})}^{T} n^{1 / 2} ({\hat{θ}}_{τ} - θ_{0}) + o_{p} (1) . \end{matrix}$

Therefore, (A9) $n^{1 / 2} ({\hat{θ}}_{τ} - {\tilde{θ}}_{τ}) = Q_{τ} (θ_{0}) G {(θ_{0})}^{T} n^{1 / 2} ({\hat{θ}}_{τ} - θ_{0}) + o_{p} (1) .$

On the other hand, we know that $n^{1 / 2} ({\hat{θ}}_{τ} - θ_{0}) \underset{n ⟶ \infty}{\overset{L}{⟶}} N (0, S_{τ} {(θ_{0})}^{- 1} K_{τ} (θ_{0}) S_{τ} {(θ_{0})}^{- 1}) .$

From Equations (19) and (25), we can establish that $\begin{matrix} B_{τ} (θ_{0}) & = & S_{τ} {(θ_{0})}^{- 1} G (θ_{0}) {[G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} G (θ_{0})]}^{- 1} G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} \\ = & Q_{τ} (θ_{0}) G {(θ_{0})}^{- 1} S_{τ} {(θ_{0})}^{- 1} . \end{matrix}$

Therefore, it follows that $n^{1 / 2} ({\hat{θ}}_{τ} - {\tilde{θ}}_{τ}) \underset{n ⟶ \infty}{\overset{L}{⟶}} N (0, B_{τ} (θ_{0}) K_{τ} (θ_{0}) B_{τ} {(θ_{0})}^{T}) .$

Now, observe from the definition that $B_{τ} (θ_{0}) = B_{τ} {(θ_{0})}^{T} .$

Then, the asymptotic distribution of the random variables $T_{γ} ({\hat{θ}}_{τ}, {\tilde{θ}}_{τ}) = 2 n R_{γ} (f_{{\hat{θ}}_{τ}}, f_{{\tilde{θ}}_{τ}})$ and $n^{1 / 2} {({\hat{θ}}_{τ} - {\tilde{θ}}_{τ})}^{T} \frac{S_{γ} (θ_{0})}{κ_{γ} (θ_{0})} n^{1 / 2} ({\hat{θ}}_{τ} - {\tilde{θ}}_{τ})$ are the same, as we have established that $n \times o ({∥{\hat{θ}}_{τ} - {\tilde{θ}}_{τ}∥}^{2}) = o_{p} (1) .$

Next, we apply Corollary 2.1 in Dik and Gunst [30], which states: “Let $X$ be a q-variate normal random variable with mean vector $0$ and variance-covariance matrix $Σ$ . Let $M$ be a real symmetric matrix of order q. Let $k = rank (Σ M Σ)$ , $k \geq 1$ and let $λ_{1}, \dots, λ_{k},$ be the nonzero eigenvalues of $M Σ .$ Then, the distribution of the quadratic form $X^{T} MX$ coincides with the distribution of the random variable $\sum_{i = 1}^{k} λ_{i} Z_{i}^{2},$ where $Z_{1}, \dots, Z_{k}$ are independent, each having a standard normal variable”. In our case, the asymptotic distribution of $T_{γ} ({\hat{θ}}_{τ}, {\tilde{θ}}_{τ})$ coincides with the distribution of the random variable $\sum_{i = 1}^{k} λ_{i}^{τ, γ} (θ_{0}) Z_{i}^{2}$ where $λ_{1}^{τ, γ} (θ_{0}), \dots, λ_{k}^{τ, γ} (θ_{0})$ , are the nonzero eigenvalues of $A_{γ} (θ_{0}) B_{τ} (θ_{0}) K_{τ} (θ_{0}) B_{τ} (θ_{0})$ and (A10) $k = min {r, r a n k (B_{τ} (θ_{0}) K_{τ} (θ_{0}) B_{τ} (θ_{0}) A_{γ} (θ_{0}) B_{τ} (θ_{0}) K_{τ} (θ_{0}) B_{τ} (θ_{0}))} .$

We now establish that $k = r$ . The matrix, $N_{τ} (θ_{0}) = B_{τ} (θ_{0}) K_{τ} (θ_{0}) B_{τ} (θ_{0})$ is given by $\begin{matrix} N_{τ} (θ_{0}) & = & S_{τ} {(θ_{0})}^{- 1} G (θ_{0}) {[G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} G (θ_{0})]}^{- 1} G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} \\ K_{τ} (θ_{0}) S_{τ} {(θ_{0})}^{- 1} G (θ_{0}) {[G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} G (θ_{0})]}^{- 1} G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} . \end{matrix}$

Corollary 14.11.3 in Harville [31] (p. 259) establishes the following: “ For any $m \times n$ matrix $A$ and any $m \times m$ symmetric positive definite matrix $W$ , $r a n k (A^{T} W$ $A) = r a n k (A) " .$ Based on this Corollary we have that $r a n k (N_{τ} (θ_{0}))$ coincides with $r a n k (N_{τ} (θ_{0}) S_{γ} (θ_{0}) N_{τ} (θ_{0})) .$

On the other hand, we know the following additional properties:

(a)
$r a n k (AB) = r a n k (A)$ if $B$ is full rank (Corollary b.3.3 in Harville [31] (p. 83)).

(b)
$r a n k (AB) = r a n k (BA)$ if dimension of $A$ coincides with dimension of $B^{T} .$

Matrix $K_{τ} (θ_{0})$ should be “full rank”; in fact, if $K_{τ} (θ_{0})$ were not full rank, the variance–covariance matrix of ${\hat{θ}}_{β}$ and ${\tilde{θ}}_{β}$ would not be full rank (there were redundant components in $θ$ and this is not true).

Therefore, we have $\begin{matrix} r a n k (N_{τ} (θ_{0})) & = & _{(a)} r a n k (S_{τ} {(θ_{0})}^{- \frac{1}{2}} G (θ_{0}) {[G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} G (θ_{0})]}^{- 1} \\ G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} K_{τ} (θ_{0}) S_{τ} {(θ_{0})}^{- 1} G (θ_{0}) \\ {[G {(θ_{0})}^{- 1} S_{τ} {(θ_{0})}^{- 1} G (θ_{0})]}^{- 1} G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- \frac{1}{2}}) \\ = & _{(b)} r a n k (G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} K_{τ} (θ_{0}) S_{τ} {(θ_{0})}^{- 1} G (θ_{0}) {[G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} G (θ_{0})]}^{- 1}) \\ = & _{(a)} r a n (G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} K_{τ} (θ_{0}) S_{τ} {(θ_{0})}^{- 1} G (θ_{0})) \\ = & _{C o r o l l a r y 14.11 . 3} r a n k (S_{τ} {(θ_{0})}^{- 1} G (θ_{0})) \\ = & _{(a)} r a n k (G (θ_{0})) = r . \end{matrix}$

Appendix A.3. Rényi’s Pseudodistance between Normal Populations

Here, we compute the expression of the RP between densities belonging to the normal model with parameters $(μ_{1}, σ_{1})$ and $(μ_{2}, σ_{2}),$ respectively. The RP between $N (μ_{1}, σ_{1})$ and $N (μ_{2}, σ_{2})$ is given by $\begin{matrix} R_{γ} (N (μ_{1}, σ_{1}), N (μ_{2}, σ_{2})) & = & \frac{1}{γ + 1} log \int N {(μ_{1}, σ_{1})}^{γ + 1} d x \\ + \frac{1}{γ (γ + 1)} log \int N {(μ_{2}, σ_{2})}^{γ + 1} d x - \frac{1}{γ} log \int N {(μ_{1}, σ_{1})}^{γ} N (μ_{2}, σ_{2}) d x \\ = & \frac{1}{γ + 1} log L_{1} + \frac{1}{γ (γ + 1)} log L_{2} - \frac{1}{γ} log L_{3} . \end{matrix}$

We first compute $\int N {(μ, σ)}^{β} d x$ for the seek of simplicity in later calculations. $\begin{matrix} \int N {(μ, σ)}^{β} d x & = & \int {(\frac{1}{σ \sqrt{2 π}} e^{- \frac{1}{2} {(\frac{x - μ}{σ})}^{2}})}^{β} d x \\ = & \frac{1}{σ^{β - 1} {(\sqrt{2 π})}^{β - 1}} \frac{1}{\sqrt{β}} \int \frac{1}{\frac{σ}{\sqrt{β}} \sqrt{2 π}} e^{- \frac{1}{2} {(\frac{x - μ}{\frac{σ}{\sqrt{β}}})}^{2}} d x \\ = & \frac{1}{σ^{β - 1} {(\sqrt{2 π})}^{β - 1}} \frac{1}{\sqrt{β}} . \end{matrix}$ Therefore, $L_{1} = \frac{1}{σ_{1}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{\sqrt{γ + 1}} and L_{2} = \frac{1}{σ_{2}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{\sqrt{γ + 1}} .$

In relation with $L_{3}$ we have, $\begin{matrix} L_{3} & = & \int N {(μ_{1}, σ_{1})}^{γ} N (μ_{2}, σ_{2}) d x \\ = & \int \frac{1}{σ_{1}^{γ} {(\sqrt{2 π})}^{γ}} e^{- \frac{1}{2} {(\frac{x - μ_{1}}{\frac{σ_{1}}{\sqrt{γ}}})}^{2}} \frac{1}{σ_{2} \sqrt{2 π}} e^{- \frac{1}{2} {(\frac{x - μ_{2}}{σ_{2}})}^{2}} \\ = & \frac{1}{σ_{1}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{σ_{2} \sqrt{2 π}} \times \\ \times \int exp \{- \frac{1}{2} [x^{2} (\frac{1}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{1}{σ_{2}^{2}}) - 2 x (\frac{μ_{1}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}}{σ_{2}^{2}}) + \frac{μ_{1}^{2}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}^{2}}{σ_{2}^{2}}]\} d x \\ = & \frac{1}{σ_{1}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{σ_{2} \sqrt{2 π}} exp \{- \frac{1}{2} [\frac{μ_{1}^{2}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}^{2}}{σ_{2}^{2}}]\} \times \\ \times \int exp \{- \frac{1}{2} [x^{2} (\frac{1}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{1}{σ_{2}^{2}}) - 2 x (\frac{μ_{1}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}}{σ_{2}^{2}})]\} d x \\ = & \frac{1}{σ_{1}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{σ_{2} \sqrt{2 π}} exp \{- \frac{1}{2} [\frac{μ_{1}^{2}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}^{2}}{σ_{2}^{2}}]\} exp \{\frac{1}{2} \frac{A^{2}}{B^{2}}\} B \sqrt{2 π} \times \\ \times \int \frac{1}{\sqrt{2 π} B} exp \{- \frac{1}{2} {(\frac{x - A}{B})}^{2}\} d x \\ = & \frac{1}{σ_{1}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{σ_{2} \sqrt{2 π}} exp \{- \frac{1}{2} [\frac{μ_{1}^{2}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}^{2}}{σ_{2}^{2}}]\} exp \{\frac{1}{2} \frac{A^{2}}{B^{2}}\} B \sqrt{2 π} . \end{matrix}$

Now it is necessary to obtain A and $B .$ However, for this, we have, $\{\begin{matrix} \frac{1}{B^{2}} = \frac{1}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{1}{σ_{2}^{2}} \\ \frac{A}{B^{2}} = (\frac{μ_{1}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}}{σ_{2}^{2}}) \end{matrix} .$

Then, $A (\frac{1}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{1}{σ_{2}^{2}}) = \frac{μ_{1}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}}{σ_{2}^{2}}$ and $A = \frac{\frac{μ_{1}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}}{σ_{2}^{2}}}{\frac{1}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{1}{σ_{2}^{2}}} = \frac{\frac{σ_{2}^{2} μ_{1} + μ_{2} {(\frac{σ_{1}}{\sqrt{γ}})}^{2}}{σ_{2}^{2} {(\frac{σ_{1}}{\sqrt{γ}})}^{2}}}{\frac{σ_{2}^{2} + {(\frac{σ_{1}}{\sqrt{γ}})}^{2}}{σ_{2}^{2} {(\frac{σ_{1}}{\sqrt{γ}})}^{2}}} = \frac{σ_{2}^{2} μ_{1} + μ_{2} \frac{σ_{1}^{2}}{γ}}{σ_{2}^{2} + \frac{σ_{1}^{2}}{γ}} = \frac{γ σ_{2}^{2} μ_{1} + μ_{2} σ_{1}^{2}}{γ σ_{2}^{2} + σ_{1}^{2}} .$

We have, $\frac{1}{B^{2}} = \frac{1}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{1}{σ_{2}^{2}} = \frac{γ}{σ_{1}^{2}} + \frac{1}{σ_{2}^{2}} = \frac{σ_{2}^{2} γ + σ_{1}^{2}}{σ_{1}^{2} σ_{2}^{2}}$

Therefore, $B = \frac{σ_{1} σ_{2}}{\sqrt{σ_{2}^{2} γ + σ_{1}^{2}}} .$

On the other hand, $\frac{A^{2}}{B^{2}} = {(\frac{γ σ_{2}^{2} μ_{1} + μ_{2} σ_{1}^{2}}{γ σ_{2}^{2} + σ_{1}^{2}})}^{2} \frac{σ_{2}^{2} γ + σ_{1}^{2}}{σ_{1}^{2} σ_{2}^{2}} = \frac{{(γ σ_{2}^{2} μ_{1} + μ_{2} σ_{1}^{2})}^{2}}{(γ σ_{2}^{2} + σ_{1}^{2}) σ_{1}^{2} σ_{2}^{2}}$ and $\begin{matrix} L_{3} & = & \frac{1}{σ_{1}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{σ_{2} \sqrt{2 π}} exp \{- \frac{1}{2} [\frac{μ_{1}^{2}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}^{2}}{σ_{2}^{2}}]\} exp \{\frac{1}{2} \frac{A^{2}}{B^{2}}\} B \sqrt{2 π} \\ = & \frac{1}{σ_{1}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{σ_{2}} exp \{- \frac{1}{2} [\frac{μ_{1}^{2}}{{(\frac{σ_{1}}{\sqrt{γ}})}^{2}} + \frac{μ_{2}^{2}}{σ_{2}^{2}}]\} exp \{\frac{1}{2} \frac{{(γ σ_{2}^{2} μ_{1} + μ_{2} σ_{1}^{2})}^{2}}{(γ σ_{2}^{2} + σ_{1}^{2}) σ_{1}^{2} σ_{2}^{2}}\} \frac{σ_{1} σ_{2}}{\sqrt{σ_{2}^{2} γ + σ_{1}^{2}}} \\ = & \frac{σ_{1} σ_{2}}{\sqrt{σ_{2}^{2} γ + σ_{1}^{2}}} \frac{1}{σ_{1}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{σ_{2}} exp \{\frac{1}{2} [\frac{{(γ σ_{2}^{2} μ_{1} + μ_{2} σ_{1}^{2})}^{2}}{(γ σ_{2}^{2} + σ_{1}^{2}) σ_{1}^{2} σ_{2}^{2}} - \frac{γ μ_{1}^{2} σ_{2}^{2} + σ_{1}^{2} μ_{2}^{2}}{σ_{2}^{2} σ_{1}^{2}}]\} . \end{matrix}$

However, $\begin{matrix} \frac{{(γ σ_{2}^{2} μ_{1} + μ_{2} σ_{1}^{2})}^{2}}{(γ σ_{2}^{2} + σ_{1}^{2}) σ_{1}^{2} σ_{2}^{2}} - \frac{γ μ_{1}^{2} σ_{2}^{2} + σ_{1}^{2} μ_{2}^{2}}{σ_{2}^{2} σ_{1}^{2}} & = & \frac{{(γ σ_{2}^{2} μ_{1} + μ_{2} σ_{1}^{2})}^{2} - (γ μ_{1}^{2} σ_{2}^{2} + σ_{1}^{2} μ_{2}^{2}) (γ σ_{2}^{2} + σ_{1}^{2})}{(γ σ_{2}^{2} + σ_{1}^{2}) σ_{1}^{2} σ_{2}^{2}} \\ = & \frac{γ^{2} σ_{2}^{4} μ_{1}^{2} + μ_{2}^{2} σ_{1}^{4} + 2 γ σ_{2}^{2} μ_{1} μ_{2} σ_{1}^{2}}{(γ σ_{2}^{2} + σ_{1}^{2}) σ_{1}^{2} σ_{2}^{2}} \\ - \frac{γ^{2} μ_{1}^{2} σ_{2}^{4} + γ μ_{1}^{2} σ_{2}^{2} σ_{1}^{2} + μ_{2}^{2} γ σ_{2}^{2} σ_{1}^{2} + μ_{2}^{2} σ_{1}^{4}}{(γ σ_{2}^{2} + σ_{1}^{2}) σ_{1}^{2} σ_{2}^{2}} \\ = & \frac{2 γ σ_{2}^{2} μ_{1} μ_{2} σ_{1}^{2} - γ μ_{1}^{2} σ_{2}^{2} σ_{1}^{2} - μ_{2}^{2} γ σ_{2}^{2} σ_{1}^{2}}{(γ σ_{2}^{2} + σ_{1}^{2}) σ_{1}^{2} σ_{2}^{2}} \\ = & \frac{σ_{2}^{2} σ_{1}^{2} γ (2 μ_{1} μ_{2} - μ_{1}^{2} - μ_{2}^{2})}{(γ σ_{2}^{2} + σ_{1}^{2}) σ_{1}^{2} σ_{2}^{2}} \\ = & - \frac{γ {(μ_{1} - μ_{2})}^{2}}{(γ σ_{2}^{2} + σ_{1}^{2})} \end{matrix}$

Therefore, $L_{3} = \frac{1}{σ_{1}^{γ - 1} \sqrt{σ_{2}^{2} γ + σ_{1}^{2}}} \frac{1}{{(\sqrt{2 π})}^{γ}} exp \{- \frac{1}{2} \frac{γ {(μ_{1} - μ_{2})}^{2}}{(γ σ_{2}^{2} + σ_{1}^{2})}\} .$

Then, $\begin{matrix} R_{γ} (N (μ_{1}, σ_{1}), N (μ_{2}, σ_{2})) & = & \frac{1}{γ + 1} ln L_{1} + \frac{1}{γ (γ + 1)} ln L_{2} - \frac{1}{γ} ln L_{3} \\ = & \frac{1}{γ + 1} ln \frac{1}{σ_{1}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{\sqrt{γ + 1}} + \frac{1}{γ (γ + 1)} ln \frac{1}{σ_{2}^{γ} {(\sqrt{2 π})}^{γ}} \frac{1}{\sqrt{γ + 1}} \\ - \frac{1}{γ} ln \frac{1}{σ_{1}^{γ - 1} \sqrt{σ_{2}^{2} γ + σ_{1}^{2}}} \frac{1}{{(\sqrt{2 π})}^{γ}} + \frac{1}{2} \frac{γ {(μ_{1} - μ_{2})}^{2}}{γ (γ σ_{2}^{2} + σ_{1}^{2})} \\ = & \frac{1}{γ (γ + 1)} (ln \frac{σ_{1}^{γ - 1}}{σ_{2}^{γ} \sqrt{γ + 1}} \sqrt{σ_{1}^{2} + γ σ_{2}^{2}} + γ ln \frac{1}{σ_{1} \sqrt{γ + 1}} \sqrt{σ_{1}^{2} + γ σ_{2}^{2}}) \\ + \frac{1}{2} \frac{γ {(μ_{1} - μ_{2})}^{2}}{γ (γ σ_{2}^{2} + σ_{1}^{2})} \\ = & \frac{1}{γ (γ + 1)} ln \frac{1}{σ_{1} σ_{2}^{γ}} {(\frac{\sqrt{σ_{1}^{2} + γ σ_{2}^{2}}}{\sqrt{γ + 1}})}^{γ + 1} + \frac{1}{2} \frac{{(μ_{1} - μ_{2})}^{2}}{(γ σ_{2}^{2} + σ_{1}^{2})} \end{matrix}$

For $γ \to 0$ we have, (A11) $lim_{γ \to 0} R_{γ} (N (μ_{1}, σ_{1}), N (μ_{2}, σ_{2})) = \frac{σ_{2}^{2} - σ_{1}^{2}}{2 σ_{1}^{2}} + ln \frac{σ_{1}}{σ_{2}} + \frac{1}{2} \frac{{(μ_{1} - μ_{2})}^{2}}{σ_{1}^{2}} .$

Appendix A.4. Computation of the Nonzero Eigenvalues of A_γ(θ₀)B_τ(θ₀)K_τ(θ₀)B_τ(θ₀)

We know that the matrix $ξ (θ)$ can be expressed as $ξ (θ) = c_{τ} (θ) κ (θ)$ with $κ (θ) = \int f_{θ} {(x)}^{τ + 1} d x = \frac{1}{σ^{τ} {(\sqrt{2 π})}^{τ} \sqrt{1 + τ}} .$

Then, $ξ (θ) = \frac{1}{σ^{τ} {(\sqrt{2 π})}^{τ} \sqrt{1 + τ}} {(0, - \frac{τ}{(τ + 1)} \frac{1}{σ})}^{T} .$

Therefore, $c_{τ} (θ) = \frac{ξ (θ)}{κ (θ)} = (0, - \frac{τ}{(τ + 1)} \frac{1}{σ}) .$

On the other hand $\frac{\partial log f_{μ, σ} (X_{i})}{\partial μ} = \frac{X_{i} - μ}{σ^{2}} and \frac{\partial log f_{μ, σ} (X_{i})}{\partial σ} = - \frac{1}{σ} + \frac{1}{σ^{3}} {(X_{i} - μ)}^{2}$ and $u_{θ} (X_{i}) = (\frac{X_{i} - μ}{σ^{2}}, - \frac{1}{σ} + \frac{1}{σ^{3}} {(X_{i} - μ)}^{2}) .$

Then, $Ψ_{τ} (X; θ) = (Ψ_{τ}^{1} (X; θ), Ψ_{τ}^{2} (X; θ))$ is given by $Ψ_{τ} (X; θ) = (\frac{X - μ}{σ^{2}} \frac{1}{{(σ \sqrt{2 π})}^{τ}} e^{- \frac{τ}{2} {(\frac{X - μ}{σ})}^{2}}, ({(\frac{X - μ}{σ})}^{2} - \frac{1}{1 + τ}) \frac{1}{σ} \frac{1}{{(σ \sqrt{2 π})}^{τ}} e^{- \frac{τ}{2} {(\frac{X - μ}{σ})}^{2}})$ and $K_{τ} (θ) = E [Ψ_{τ} (X; θ) Ψ_{τ} {(X; θ)}^{T}] .$

Now we obtain the elements of that matrix, $\begin{matrix} K_{τ}^{11} (θ) & = & E [{(\frac{X - μ}{σ^{2}})}^{2} \frac{1}{{(σ \sqrt{2 π})}^{2 τ}} e^{- \frac{2 τ}{2} {(\frac{X - μ}{σ})}^{2}}] \\ = & \frac{1}{{(σ \sqrt{2 π})}^{2 τ} {(1 + 2 τ)}^{3 / 2}} \frac{1}{σ^{2}} \end{matrix}$ $\begin{matrix} K_{τ}^{12} (θ) & = & K_{τ}^{21} (θ) = E [(\frac{X - μ}{σ^{2}}) ({(\frac{X - μ}{σ})}^{2} - \frac{1}{1 + τ}) \frac{1}{σ} \frac{1}{{(σ \sqrt{2 π})}^{2 τ}} e^{- \frac{2 τ}{2} {(\frac{X - μ}{σ})}^{2}}] \\ = & 0 \end{matrix}$ and $\begin{matrix} K_{τ}^{22} (θ) & = & E [{({(\frac{X - μ}{σ})}^{2} - \frac{1}{1 + τ})}^{2} \frac{1}{σ^{2}} \frac{1}{{(σ \sqrt{2 π})}^{2 τ}} e^{- \frac{2 τ}{2} {(\frac{X - μ}{σ})}^{2}}] \\ = & \frac{1}{σ^{2}} \frac{3 τ^{2} + 2 + 4 τ}{{(σ \sqrt{2 π})}^{2 τ} {(1 + 2 τ)}^{5 / 2} {(1 + τ)}^{2}} \end{matrix}$ and $\begin{matrix} K_{τ} (θ) & = & (\begin{matrix} \frac{1}{{(σ \sqrt{2 π})}^{2 τ} {(1 + 2 τ)}^{3 / 2}} \frac{1}{σ^{2}} & 0 \\ 0 & \frac{1}{σ^{2}} \frac{3 τ^{2} + 2 + 4 τ}{{(σ \sqrt{2 π})}^{2 τ} {(1 + 2 τ)}^{5 / 2} {(1 + τ)}^{2}} \end{matrix}) \\ = & \frac{1}{σ^{2}} \frac{1}{{(σ \sqrt{2 π})}^{2 τ} {(1 + 2 τ)}^{3 / 2}} (\begin{matrix} 1 & 0 \\ 0 & \frac{3 τ^{2} + 2 + 4 τ}{{(1 + τ)}^{2} (1 + 2 τ)} \end{matrix}) . \end{matrix}$

Now we obtain the matrix $S_{τ} (θ) .$ We have $ξ (θ) = c_{τ} (θ) κ (θ)$ with $κ (θ) = \int f_{θ} {(x)}^{τ + 1} d x = \frac{1}{σ^{τ} {(\sqrt{2 π})}^{τ} \sqrt{1 + τ}} .$

Then, $ξ (θ) = \frac{1}{σ^{τ} {(\sqrt{2 π})}^{τ} \sqrt{1 + τ}} {(0, - \frac{τ}{(τ + 1)} \frac{1}{σ})}^{T}$ and $\frac{1}{κ (θ)} ξ (θ) ξ {(θ)}^{T} = \frac{1}{σ^{τ + 2} {(\sqrt{2 π})}^{τ} \sqrt{1 + τ}} (\begin{matrix} 0 & 0 \\ 0 & \frac{τ^{2}}{{(τ + 1)}^{2}} \end{matrix}) .$

On the other hand $J_{τ} (θ) = E [(\begin{matrix} \frac{1}{σ^{4}} {(X - μ)}^{2} & \frac{1}{σ^{2}} (\frac{1}{σ} - \frac{1}{σ^{3}} {(X - μ)}^{2}) (X - μ) \\ \frac{1}{σ^{2}} (\frac{1}{σ} - \frac{1}{σ^{3}} {(X - μ)}^{2}) (X - μ) & {(\frac{1}{σ} - \frac{1}{σ^{3}} {(X - μ)}^{2})}^{2} \end{matrix}) \frac{1}{{(σ \sqrt{2 π})}^{τ}} e^{- \frac{τ}{2} {(\frac{X - μ}{σ})}^{2}}]$ $\begin{matrix} J_{τ}^{11} (θ) & = & E [\frac{1}{σ^{4}} {(μ - X)}^{2} \frac{1}{{(σ \sqrt{2 π})}^{τ}} e^{- \frac{τ}{2} {(\frac{X - μ}{σ})}^{2}}] = \frac{1}{σ^{τ + 2}} \frac{1}{{(τ + 1)}^{3 / 2}} \frac{1}{{(\sqrt{2 π})}^{τ}} \\ J_{τ}^{12} (θ) & = & J_{τ}^{21} (θ) = 0 \\ J_{τ}^{22} (θ) & = & E [{(\frac{1}{σ} - \frac{1}{σ^{3}} {(μ - X)}^{2})}^{2} \frac{1}{{(σ \sqrt{2 π})}^{τ}} e^{- \frac{τ}{2} {(\frac{X - μ}{σ})}^{2}}] = \frac{1}{σ^{τ + 2}} \frac{1}{{(\sqrt{2 π})}^{τ}} \frac{1}{\sqrt{1 + τ}} \frac{2 + τ^{2}}{{(1 + τ)}^{2}} \end{matrix}$

Therefore $J_{τ} (θ) = \frac{1}{σ^{τ + 2}} \frac{1}{{(\sqrt{2 π})}^{τ}} \frac{1}{\sqrt{1 + τ}} (\begin{matrix} \frac{1}{1 + τ} & 0 \\ 0 & \frac{2 + τ^{2}}{{(1 + τ)}^{2}} \end{matrix})$ $\begin{matrix} S_{τ} (θ) & = J_{τ} (θ) - \frac{1}{κ (θ)} ξ (θ) ξ {(θ)}^{T} \\ = \frac{1}{σ^{τ + 2}} \frac{1}{{(\sqrt{2 π})}^{τ}} \frac{1}{\sqrt{1 + τ}} ((\begin{matrix} \frac{1}{1 + τ} & 0 \\ 0 & \frac{2 + τ^{2}}{{(1 + τ)}^{2}} \end{matrix}) - (\begin{matrix} 0 & 0 \\ 0 & \frac{τ^{2}}{{(τ + 1)}^{2}} \end{matrix})) \\ = \frac{1}{σ^{τ + 2}} \frac{1}{{(\sqrt{2 π})}^{τ}} \frac{1}{\sqrt{1 + τ}} (\begin{matrix} \frac{1}{1 + τ} & 0 \\ 0 & \frac{2}{{(τ + 1)}^{2}} \end{matrix}) \end{matrix}$

Now we have,

The matrix ${[G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} G (θ_{0})]}^{- 1}$ $(G (θ) = {(0, 1)}^{T})$ $\begin{matrix} G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} G (θ_{0}) & = & (\begin{matrix} 0 & 1 \end{matrix}) {(\frac{1}{σ^{τ + 2}} \frac{1}{{(\sqrt{2 π})}^{τ}} \frac{1}{\sqrt{1 + τ}} (\begin{matrix} \frac{1}{1 + τ} & 0 \\ 0 & \frac{2}{{(τ + 1)}^{2}} \end{matrix}))}^{- 1} (\begin{matrix} 0 \\ 1 \end{matrix}) \\ = & \frac{1}{2} σ^{2} σ^{τ} {(τ + 1)}^{\frac{5}{2}} {(\sqrt{2} \sqrt{π})}^{τ} \end{matrix}$
The matrix $Q_{τ} (θ_{0}) = S_{τ}^{- 1} (θ_{0}) G (θ_{0}) {[G^{T} (θ_{0}) S_{τ}^{- 1} (θ_{0}) G (θ_{0})]}^{- 1}$ $\begin{matrix} Q_{τ} (θ_{0}) & = & {(\frac{1}{σ^{τ + 2}} \frac{1}{{(\sqrt{2 π})}^{τ}} \frac{1}{\sqrt{1 + τ}} (\begin{matrix} \frac{1}{1 + τ} & 0 \\ 0 & \frac{2}{{(τ + 1)}^{2}} \end{matrix}))}^{- 1} (\begin{matrix} 0 \\ 1 \end{matrix}) {(\frac{1}{2} σ^{2} σ^{τ} {(τ + 1)}^{\frac{5}{2}} {(\sqrt{2} \sqrt{π})}^{τ})}^{- 1} \\ = & (\begin{matrix} 0 \\ 1 \end{matrix}) \end{matrix}$
The matrix $B_{τ} (θ_{0}) = S_{τ} {(θ_{0})}^{- 1} G (θ_{0}) {[G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} G (θ_{0})]}^{- 1} G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1} = Q_{τ} (θ_{0}) G {(θ_{0})}^{T} S_{τ} {(θ_{0})}^{- 1}$ $\begin{matrix} B_{τ} (θ_{0}) & = & Q_{τ} (θ_{0}) G^{T} (θ_{0}) S_{τ}^{- 1} (θ_{0}) = (\begin{matrix} 0 \\ 1 \end{matrix}) (\begin{matrix} 0 & 1 \end{matrix}) {(\frac{1}{σ^{τ + 2}} \frac{1}{{(\sqrt{2 π})}^{τ}} \frac{1}{\sqrt{1 + τ}} (\begin{matrix} \frac{1}{1 + τ} & 0 \\ 0 & \frac{2}{{(τ + 1)}^{2}} \end{matrix}))}^{- 1} \\ = & (\begin{matrix} 0 & 0 \\ 0 & \frac{1}{2} σ^{2} σ^{τ} {(τ + 1)}^{\frac{5}{2}} {(\sqrt{2} \sqrt{π})}^{τ} \end{matrix}) \end{matrix}$
The matrix $M_{γ, τ} (θ_{0}) = \frac{S_{γ} (θ_{0})}{κ_{γ} (θ_{0})} B_{τ} (θ_{0}) K_{τ} (θ_{0}) B_{τ} (θ_{0})$ $\begin{matrix} M_{γ, τ} (θ_{0}) & = & \frac{σ^{γ} {(\sqrt{2 π})}^{γ} \sqrt{1 + γ}}{σ^{γ + 2}} \frac{1}{{(\sqrt{2 π})}^{γ}} \frac{1}{\sqrt{1 + γ}} (\begin{matrix} \frac{1}{1 + γ} & 0 \\ 0 & \frac{2}{{(γ + 1)}^{2}} \end{matrix}) \\ \times (\begin{matrix} 0 & 0 \\ 0 & \frac{1}{2} σ^{2} σ^{τ} {(τ + 1)}^{\frac{5}{2}} {(\sqrt{2} \sqrt{π})}^{τ} \end{matrix}) \\ \times \frac{1}{σ^{2}} \frac{1}{{(σ \sqrt{2 π})}^{2 τ} {(1 + 2 τ)}^{3 / 2}} (\begin{matrix} 1 & 0 \\ 0 & \frac{3 τ^{2} + 2 + 4 τ}{{(1 + τ)}^{2} (1 + 2 τ)} \end{matrix}) \\ \times (\begin{matrix} 0 & 0 \\ 0 & \frac{1}{2} σ^{2} σ^{τ} {(τ + 1)}^{\frac{5}{2}} {(\sqrt{2} \sqrt{π})}^{τ} \end{matrix}) \\ = & (\begin{matrix} 0 & 0 \\ 0 & \frac{1}{2} \frac{{(τ + 1)}^{3}}{{(γ + 1)}^{2} {(2 τ + 1)}^{\frac{5}{2}}} (3 τ^{2} + 4 τ + 2) \end{matrix}) . \end{matrix}$

References

1. Beran, R. Minimum Hellinger distance estimates for parametric models. Ann. Stat.; 1977; 5, pp. 445-463. [DOI: https://dx.doi.org/10.1214/aos/1176343842]

2. Tamura, R.N.; Boos, D.D. Minimum Hellinger distance estimation for multivariate location and covariance. J. Am. Stat. Assoc.; 1986; 81, pp. 223-229. [DOI: https://dx.doi.org/10.1080/01621459.1986.10478264]

3. Simpson, D.G. Minimum Hellinger distance estimation for the analysis of count data. J. Am. Stat. Assoc.; 1987; 82, pp. 802-807. [DOI: https://dx.doi.org/10.1080/01621459.1987.10478501]

4. Simpson, D.G. Hellinger deviance tests: Efficiency, breakdown points, and examples. J. Am. Stat. Assoc.; 1989; 84, pp. 107-113. [DOI: https://dx.doi.org/10.1080/01621459.1989.10478744]

5. Lindsay, B.G. Efficiency versus robustness: The case for minimum Hellinger distance and related methods. Ann. Stat.; 1994; 22, pp. 1081-1114. [DOI: https://dx.doi.org/10.1214/aos/1176325512]

6. Pardo, L. Statistical Inference Based on Divergence Measures; Chapman & Hall/CRC: Boca de Raton, FL, USA, 2006.

7. Basu, A.; Shioya, H.; Park, C. Statistical Inference: The minimum Distance Approach; Chapman & Hall/CRC Press: Boca de Raton, FL, USA, 2011.

8. Broniatowski, M.; Toma, A.; Vajda, I. Decomposable pseudodistances and applications in statistical estimation. J. Stat. Plan. Inference; 2012; 142, pp. 2574-2585. [DOI: https://dx.doi.org/10.1016/j.jspi.2012.03.019]

9. Castilla, E.; Jaenada, M.; Pardo, L. Estimation and testing on independent not identically distributed observations based on Rényi’s pseudodistances. IEEE Trans. Inf. Theory; 2022.in press [DOI: https://dx.doi.org/10.1109/TIT.2022.3158308]

10. Silvey, S.D. Reprinting, Monographs on Statistical Subjects; Chapman and Hall: London, UK, 1975.

11. Basu, A.; Mandal, A.; Martin, N.; Pardo, L. Testing Composite Hypothesis Based on the Density Power Divergence. Sankhya B Indian J. Stat.; 2018; 80, pp. 222-262. [DOI: https://dx.doi.org/10.1007/s13571-017-0143-0]

12. Ghosh, A. Influence function analysis of the restricted minimum divergence estimators: A general form. Electron. J. Stat.; 2015; 9, pp. 1017-1040. [DOI: https://dx.doi.org/10.1214/15-EJS1025]

13. Jones, M.C.; Hjort, N.L.; Harris, I.R.; Basu, A. A comparison of related density-based minimum divergence estimators. Biometrika; 2001; 88, pp. 865-873. [DOI: https://dx.doi.org/10.1093/biomet/88.3.865]

14. Fujisawa, H.; Eguchi, S. Robust parameter estimation with a small bias against heavy contamination. J. Multivariante Anal.; 2008; 99, pp. 2053-2081. [DOI: https://dx.doi.org/10.1016/j.jmva.2008.02.004]

15. Castilla, E.; Jaenada, M.; Martin, N.; Pardo, L. Robust approach for comparing two dependent normal populations through Wald-type tests based on Rényi’s pseudodistance estimators. arXiv; 2022; arXiv: 2202.00982

16. Toma, A.; Leoni-Aubin, S. Robust tests based on dual divergence estimators and saddlepoint approximations. J. Multivariante Anal.; 2010; 101, pp. 1143-1155. [DOI: https://dx.doi.org/10.1016/j.jmva.2009.11.001]

17. Toma, A.; Karagrigoriou, A.; Trentou, P. Robust model selection criteria based on pseudodistances. Entropy; 2020; 22, 304. [DOI: https://dx.doi.org/10.3390/e22030304] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33286078]

18. Castilla, E.; Martin, N.; Muñoz, S.; Pardo, L. Robust Wald-type tests based on Minimum Rényi Pseudodistance Estimators for the Multiple Regression Model. J. Stat. Comput. Simul.; 2020; 14, pp. 2592-2613. [DOI: https://dx.doi.org/10.1080/00949655.2020.1787410]

19. Castilla, E.; Ghosh, A.; Jaenada, M.; Pardo, L. On regularization methods based on Rényi’s pseudodistances for sparse high-dimensional linear regression models. arXiv; 2022; arXiv: 2202.00982

20. Jaenada, M.; Pardo, L. The minimum Renyi’s Pseudodistances estimators for Generalized Linear Models. Data Analysis and Related Applications: Theory and Practice; Proceeding of the ASMDA Wiley: Athens, Greece, 2021.

21. Jaenada, M.; Pardo, L. Robust Statistical Inference in Generalized Linear Models Based on Minimum Renyi’s Pseudodistance Estimators. Entropy; 2022; 24, 123. [DOI: https://dx.doi.org/10.3390/e24010123]

22. Hampel, F.R.; Ronchetti, E.; Rousseauw, P.J.; Stahel, W. Robust Statistics: The Approach Based on Influence Functions; John Wiley & Sons: Hoboken, NJ, USA, 1986.

23. Rao, C.R. Score test: Historical review and recent developments. Advances in Ranking and Selection, Multiple Comparisons, and Reliability; Birkhäuser: Boston, MA, USA, 2005; pp. 3-20.

24. Basu, A.; Ghosh, A.; Martin, N.; Pardo, L. A Robust Generalization of the Rao Test. J. Bus. Econ. Stat.; 2021; 40, pp. 868-879. [DOI: https://dx.doi.org/10.1080/07350015.2021.1876711]

25. Warwick, J.; Jones, M.C. Choosing a robustness tuning parameter. J. Stat. Comput. Simul.; 2005; 75, pp. 581-588. [DOI: https://dx.doi.org/10.1080/00949650412331299120]

26. Basak, S.; Basu, A.; Jones, M.C. On the optimal density power divergence tuning parameter. J. Appl. Stat.; 2021; 48, pp. 536-556. [DOI: https://dx.doi.org/10.1080/02664763.2020.1736524]

27. Welch, W.J. Rerandomizing the median in matched-pairs designs. Biometrika; 1987; 74, pp. 609-614. [DOI: https://dx.doi.org/10.1093/biomet/74.3.609]

28. Basu, A.; Mandal, A.; Martin, N.; Pardo, L. Testing statistical hypotheses based on the density power divergence. Ann. Inst. Stat. Math.; 2013; 65, pp. 319-348. [DOI: https://dx.doi.org/10.1007/s10463-012-0372-y]

29. Darwin, C. The Effects of Cross and Self Fertilisation in the Vegetable Kingdom; AMS Press Inc.: New York, NY, USA, 1877.

30. Dik, J.J.; de Gunst, M.C.M. The Distribution of General Quadratic Forms in Norma. Stat. Neerl.; 1985; 39, pp. 14-26. [DOI: https://dx.doi.org/10.1111/j.1467-9574.1985.tb01121.x]

31. Harville, D.A. Matrix Algebra from a Statistician’s Perspective; Springer: New York, NY, USA, 2008.

Word count: 6950

Show less

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

The Rao’s score, Wald and likelihood ratio tests are the most common procedures for testing hypotheses in parametric models. None of the three test statistics is uniformly superior to the other two in relation with the power function, and moreover, they are first-order equivalent and asymptotically optimal. Conversely, these three classical tests present serious robustness problems, as they are based on the maximum likelihood estimator, which is highly non-robust. To overcome this drawback, some test statistics have been introduced in the literature based on robust estimators, such as robust generalized Wald-type and Rao-type tests based on minimum divergence estimators. In this paper, restricted minimum Rényi’s pseudodistance estimators are defined, and their asymptotic distribution and influence function are derived. Further, robust Rao-type and divergence-based tests based on minimum Rényi’s pseudodistance and restricted minimum Rényi’s pseudodistance estimators are considered, and the asymptotic properties of the new families of tests statistics are obtained. Finally, the robustness of the proposed estimators and test statistics is empirically examined through a simulation study, and illustrative applications in real-life data are analyzed.

Details

Title

Robust Test Statistics Based on Restricted Minimum Rényi’s Pseudodistance Estimators

Author

Jaenada, María

; Miranda, Pedro

; Pardo, Leandro

First page

616

Publication year

2022

Publication date

2022

Publisher

MDPI AG

e-ISSN

10994300

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.3390/e24050616

ProQuest document ID

2670145815

Robust Test Statistics Based on Restricted Minimum Rényi’s Pseudodistance Estimators

Jump to:

Full text

Abstract

Details

Suggested sources