1. Introduction
Let be the statistical space associated with the random variable where is the -field of Borel subsets and is a family of probability distributions defined on the measurable space whit an open subset of and We assume that the probability measures are described by densities where is a -finite measure on Given a random sample of the random variable X with density belonging to the parametric family , the most popular estimator for the model parameter is the maximum likelihood estimator (MLE), which maximizes the likelihood function of the assumed model. The MLE has been widely studied in the literature for general statistical models, and it has been shown that, under certain regularity conditions, the sequence of MLEs of is asymptotically normal and it satisfies some desirable properties, such as consistency and asymptotic efficiency. That is, the MLE is the BAN (best asymptotically normal) estimator. However, in many popular statistical models, the MLE is markedly non-robust against deviations, even very small ones, from the parametric conditions.
To overcome the lack of robustness, minimum distance (or minimum divergence) estimators (MDEs) have been developed. MDEs have received growing attention in statistical inference because of their ability to conciliate efficiency and robustness. In parametric estimation, the role of divergence or distance measures is very intuitive: the estimates of the unknown parameters are obtained by minimizing a suitable divergence measure between the estimated from data and the assumed model distributions. There is a growing body of literature that recognizes the importance of MDEs in terms of robustness, without a significant loss of efficiency, with respect to the MLE. See, for instance, the works of Beran [1], Tamura and Boes [2], Simpson [3,4], Lindsay [5], Pardo [6], and Basu et al. [7] and the references therein.
Let G denote the unknown distribution function, with associated density underlying the data. The minimum divergence (distance) functional evaluated at G, , is defined as
(1)
with being a distance or divergence measure between the densities g and As the true distribution underlying the data is unknown, given a random sample, we could estimate the model parameter , substituting in the previous expression the true distribution G by its empirical estimation Therefore, the MDE of is given by(2)
When dealing with continuous models, it is convenient to consider families of divergence measures for which non-parametric estimators of the unknown density function are not needed. From this perspective, the density power divergence (DPD) family, leading to the minimum density power divergence estimators (MDPDEs) (see Basu et al. [7]), as well as the Rényi’s pseudodistance (RP), leading to the minimum Rényi’s pseudodistance estimators (MRPE) (see Broniatowski et al. [8]) between others, play an important role. The results presented in Broniatowski et al. [8] in the context of independent and identically distributed random variables were extended for the case of independent but not identically distributed random variables by Castilla et al. [9].
In many situations we have additional knowledge about the true parameter value, as it must satisfy certain constraints. Then, the restricted parameter space has the form
(3)
where denotes the null vector of dimension r, and is a vector-valued function such that the matrix(4)
exists and is continuous in and rank . Here, superscript T represents the transpose of the matrix. In the following, the restricted parameter space given in (3) is denoted by as in most situations, it will represent a composite null hypothesis.The most popular estimator of under the non-linear constraint given in (3) is the restricted MLE (RMLE) that maximizes the likelihood function subject to the constraint (see Silvey [10]). The RMLE encounters similar robustness problems to the MLE. To overcome such deficiency, the restricted MDPDEs (RMDPDEs) were introduced in Basu et al. [11] and their theoretical robustness properties were later studied in Ghosh [12].
The main purpose in this paper is extending the theory developed for the MRPE to the restricted parameter space setting, yielding to the restricted MRPE (RMPRE), where the parameter space has the form (3). The rest of the paper is as follows: In Section 2, MRPE is introduced. Section 3 presents RMPRE, and its asymptotic distribution as well as its influence function are obtained. In Section 4, two different test statistics for testing composite null hypothesis, based on the RMRPE, are developed, and explicit expressions of the statistics are presented for testing in normal populations. Section 5 presents a simulation study, where the robustness of the proposed estimators and test statistics is empirically shown. Section 6 deals with real-data situations. Finally, some conclusions are presented in Section 7.
2. Minimum Rényi Pseudodistance Estimators
In this section, we introduce the MRPE. We derive the estimating equations of the MRPE and recall its asymptotic distribution.
Let be a random sample of size n from a population having true and unknown density function modeled by a parametric family of densities with The RP between the densities and g is given, for by
The RP can be defined for taking continuous limits, yielding the expression
Then, the RP coincides with the Kullback–Leibler divergence (KL) between g and , at (see Pardo, 2006).
The RP was considered for the first time by Jones et al. [13]. Later Broniatowski et al. [8] established some useful properties of the divergence, such as the positivity of the RP for any two densities and for all values of the parameter and uniqueness of the minimum RP within a parametric family, that is, if and only if The last property justifies the definition of the MRPEs as the minimizer of the RP between the assumed distribution and the empirical distribution of the data. It is interesting to note that the so-called RP by Broniatowski et al. [8] had been previously considered by Fujisawa and Eguchi [14] under the name of -cross entropy. In that paper, some appealing robustness properties of the estimators based on such entropy are shown.
Given a sample , from Broniatowski et al. [8] it can be seen that minimizing leads to the following definition.
Let be a statistical space. The MRPE based on the random sample for the unknown parameter θ is given, for , by
(5)
where
Further, at minimizes the KL divergence, and thus the MRPE coincides with the MLE for Based on the previous definition (5), differentiating, we obtain that the estimating equations of the MRPE are given by
(6)
with(7)
being(8)
(9)
(10)
The MRPE is an M-estimator and thus its asymptotic distribution and influence function (IF) can be obtained based on the asymptotic theory of the M-estimators. Broniatowski et al. [8] studied the asymptotic properties and robustness of the MRPEs. The next result recalls the asymptotic distribution of the MRPEs.
Let be the true unknown value of Then,
(11)
where(12)
with(13)
(14)
Castilla et al. [15] introduced useful notation for the computation of
(15)
(16)
where(17)
and and are as in (9) and (10), respectively.Toma and Leoni-Aubin [16] defined new robust and efficient measures based on the RP. Later, Toma et al. [17] considered the MRPE for general parametric models and developed a model selection criterion for regression models. Broniatowski et al. [8] applied the method to the multiple regression model (MRM) with random covariates. Subsequently, Castilla et al. [18] developed Wald-type tests based on MRPE for the MRM, and Castilla et al. [19] studied the MRPE for the MRM in the ultra-high dimensional set-up. Further, Jaenada and Pardo [20,21] considered the MRPE and Wald-type test statistics for generalized linear models (GLM). Despite Wald-type test statistics, there exist others relevant test statistics having an important role in the statistical literature: the likelihood-ratio and Rao (or score) tests, which are based on restricted estimators, usually the RMLE. Then, it makes sense to develop robust versions of these popular statistics based on the RMRPE.
3. The Restricted Minimum Rényi Pseudodistance Estimator: Asymptotic Distribution and Influence Function of RMRPE
In this section, we introduce the RMRPE and we derive its asymptotic distribution. Moreover, we study its robustness properties through its influence function (IF).
The RMRPE functional evaluated at the distribution G is defined by
given that such a minimum exists.
Accordingly, given random sample from the distribution G, the RMRPE of θ is defined as
Next, the result states the asymptotic distribution of the RMRPE,
Suppose that the true distribution satisfies the conditions of the model and let us denote by the true parameter. Then, the RMRPE of θ obtained under the constraints has distribution
where
(18)
(19)
and is defined in (13), evaluated at .See Appendix A. □
To analyze the robustness of an estimator, Hampel et al. [22] introduced the concept of the influence function (IF). Since then, the IF has been widely used in statistical literature to measure robustness in different statistical contexts. Intuitively, the IF describes the effect of an infinitesimal contamination of the model on the estimate. Then, IFs associated to locally robust (B-robust) estimators should be bounded. Let us now obtain the IF of RMRPE and analyze its boundedness to asses the robustness of the proposed estimators. We consider the contaminated model with the indicator function in and we denote being the distribution function associated to By definition, is the minimizer of subject to Following the same steps as in Theorem 5 in Broniatowski et al. [8], it can be seen that the influence function of in is given by
(20)
where was defined in (8) and with the additional condition that Note that expression (20) corresponds to the IF of the unrestricted MRPE. Differentiating this last equation gives, at(21)
Based on (20) and (21) we have
Therefore,
and(22)
Note that matrices and involved in the expression (22) are defined except for the model and tuning parameters and , and so the boundedness of the IF of the RMRPE depends, therefore, on the boundedness of the factor
Therefore, the boundedness of the IF of the RMRPE depends directly on the boundedness of IF of the MRPE, stated in (20). The IF of the MRPE has been widely studied for general statistical models, concluding that the MRPEs are robust for positive values of and that such robustness increases with the tuning parameter. A whole discussion can be found in the work of Broniatowski et al. [8]. Hence, the same properties hold for RMRPEs.
4. Robust Test Statistics Based on RMRPEs
In this section, we develop two statistics based on the RMRPEs for testing composite null hypothesis, and their asymptotic distributions are obtained. Both procedures are particularized to standard deviation testing (with unknown mean) under normal populations, and explicit expressions of the test statistics are obtained.
4.1. Testing Based on Divergence Measures
In this section, we present the family of Rényi’s pseudodistance test statistics (RPTS) for testing the null hypothesis given in (3). This family of test statistics is given by
(23)
The RPTS, , can be understood as a measure between the best unrestricted estimator of the model parameter, and the best estimator satisfying the null hypothesis. Large values of the RPTS indicate that the model densities associated with the restricted and unrestricted estimators are far away one from the other, and so the null hypothesis is not supported by the observed data. Hence, we should reject for large enough . We can observe that the family of RPTS defined in (23) depends on two tuning parameters, and . The first is used for estimating the unknown parameters, while the second is applied to obtain the family of test statistics. The following theorem presents the asymptotic distribution of the family of RPTS defined in (23).
The asymptotic distribution of defined in (23) coincides, under the null hypothesis given in (3), with the distribution of the random variable
where are independent standard normal variables, are the nonzero eigenvalues of and The matrices and are given by,
(24)
(25)
See Appendix A. □
Rényi’s Pseudodistance Test Statistics for Normal Populations
Under the model, consider the problem of testing
(26)
where is an unknown nuisance parameter. In this case, the unrestricted and null parameter spaces are given by and , respectively. If we consider the function with , the null hypothesis can be written as and we are in the situation considered in (26). We can observe that in our case Based on (6) and taking into account the fact that is the normal density with mean and variance , the MRPE of is the solution of the system of nonlinear equations while the RMRPE when is the solution of the nonlinear equationAfter some algebra (see the Appendix A) we obtain that the RPTS for testing (26) under normal populations can be expressed as
(27)
Based in (27), and taking into account that the eigenvalue of the matrix is given by (see Appendix A) we apply Theorem 3 such thatNote that the RPTS is indexed by two tuning parameters, and , the first controlling the robustness of the pseudodistance and the second controlling the robustness on the estimation. For simplicity, we use for the normal population application.
For , the RPTS coincides with the asymptotic likelihood ratio test for testing (26). Indeed, for we have that the MLE and RMLE are given, respectively, by
Now, the expression of the Kullback–Leibler divergence (the RP for ) between two normal densities, and is given by
(28)
and thus the RPTS for is
On the other hand, the likelihood ratio for testing (26) is given by
and so, both expressions are related through
4.2. Rao’s-Type Tests Based on RMRPE
Rao test statistics are one of the most popular score test statistics for testing a simple and composite null hypothesis in general statistical models. For the simple null hypothesis testing, it requires no parameter estimation, but for composite ones, the classical Rao test is based on the likelihood score function associated with the restricted MLE (see Rao [23]). Basu et al. [24] generalized Rao’s procedure by using score functions associated with RMDPDEs, bringing in a considerable gain of robustness of the Rao-type test obtained. In this section, we develop Rao-type test statistics based on the score function associated to RMRPEs.
Let us consider the -score function associated to the RMRPE,
so the estimating equations for the MRPE are given byThen, the -score statistic can be defined as
However, taking expectations in the corresponding quantities, it is not difficult to show that
where is defined in (16), and so, by the central limit theorem, the -score statistic is asymptotically normal,(29)
The previous convergence motivates the definition of the Rao-type test statistics.
4.2.1. Rao-Type Test Statistics for Testing Simple Null Hypothesis
We first consider the simple null hypothesis test
(30)
Then, the Rao-type test statistics for testing (30) is defined as
Note that here the last test statistics depend on through the matrices and involved in the definition, and again, the robustness of the statistics increases with Moreover, the last matrix may have an explicit expression for certain statistical models, but otherwise it would have to be estimated from the sample.
Further, from (29), we have that, under the null hypothesis,
with p being the dimension of the parameter space. Then, the null hypothesis is rejected if , where denotes the upper -quantile of a chi-square distribution with p degrees of freedom.4.2.2. Rao-Type Test Statistics for Testing Composite Null Hypothesis
Next, let us consider composite null hypothesis of the form
(31)
where the function is a differentiable vector-valued function. Then, any vector satisfying the null hypothesis belongs to a restricted parameter space given in (3). The generalized Rao-type test statistic associated to the RMRPE with tuning parameter , for testing (31) is given by(32)
Using similar arguments to Basu et al. [24], it is possible to show that, under general regularity conditions, the Rao-type test statistics have an asymptotic chi-square distribution with r degrees of freedom under the null hypothesis given in (31). Therefore, the rejection region of the test is given by
Again, the tuning parameter controls the trade-off between efficiency and robustness of the test. Indeed, for , the generalized Rao type test statistic coincides with the classical Rao test for composite null hypothesis.
4.2.3. Rao Test for Normal Populations
Consider the test defined in (26) for testing the standard deviation value of a normal population with unknown mean. The explicit expression of the main matrices involved in the definition (32) for such testing procedure and assumed parametric model is given by
The step-by-step calculation of such values are detailed in the Appendix A. Then, the Rao-type test for composite null hypothesis of the form (31) is given by
where denotes the RMRPE with tuning parameter Note that, for , Then, the Rao-type test statistic based on RMRPE with (the restricted MLE) coincides with the classical Rao test.5. Simulation Study: Application to Normal Populations
In this section, we empirically analyze the performance of the proposed estimators under the normal parametric model and RPTS and Rao-type test statistics for the problem of testing (26) in terms of efficiency and robustness. We examine the accuracy of the RMRPEs, and we further examine the robustness properties of both families of estimators under different contamination scenarios. Further, we investigate the empirical level and power of the proposed test statistics under different sample sizes and contamination scenarios.
Let us consider a univariate normal model with true parameter value and the problem of testing
(33)
The restricted parameter space is then given by
In order to evaluate the robustness properties of the estimators and test statistics, we introduce contamination in data by replacing a of the observations by a contaminated sample, where denotes the contamination level. We generate five different scenarios of contamination:
Pure data.
Scenario 1: Slightly contaminated data. We replace a of the samples by a contaminated sample from a normal distribution,
Scenario 2: Heavily contaminated data. We replace a of the samples by a contaminated sample from a normal distribution,
Further, in order to evaluate the power of the test, we consider an alternative true parameter value which does not satisfy the null hypothesis (33) (or equivalently the restrictions of the parameter space). In this scenario, contaminated parameters are set for slightly and for heavily contamination.
Figure 1 shows the root mean square error (RMSE) of the RMRPE of the scale parameter , for different values of the tuning parameter and over replications. As expected, large values of the tuning parameter produce more robust estimators, which is particularly advantageous for the heavily contaminated scenario. Furthermore, even when introducing very low levels of contamination in data, the RMRPE with moderate value of the tuning parameter outperforms the classical MLE, without a significant loss of efficiency in the absence of contamination.
On the other hand, Figure 2 presents the empirical level and power of both RPTS and Rao-type test statistics based on RMRPEs for different values of the tuning parameter, under increasing contamination levels. The empirical level and power are computed as the mean number of rejections over replications. The empirical level produced by the classical ratio and Rao-type tests rapidly increases and separates from levels obtained with any robust test. Regarding the empirical power, all robust tests with moderate and large values of the tuning parameter outperform the classical estimator within their family under contaminated scenarios, but Rao-type test statistics based on RMRPEs are more conservative than RPTSs, thus exhibiting lower levels and powers. Then, the proposed test statistics provides an appealing alternative to classical likelihood ratio and Rao tests, with a small loss of efficiency in favor of a clear gain in terms of robustness.
On the other hand, the sample size could play a crucial role in the performance of the tests, even more accentuated when there exists data contamination. Figure 3 shows the sample size effect on the performance of the tests in terms of empirical level, under a of contamination level in data. As discussed, Rao-type test statistics based on RMRPEs is more conservative and so tests based on RMRPEs with positive values of the tuning parameter produce lower empirical levels. Here, it outperforms the poor performance of the classical Rao-type test statistics with respect to any other. Moreover, when the sample size increases, the performance gap between non-robust and robust methods is widening.
Following the discussions in the preceding sections, larger values of the tuning parameter produce more robust but less efficient estimators. Therefore, the optimal value of should obtain the best trade-off between efficiency and robustness. Warwick and Jones [25] first introduced a useful data-based procedure for the choice of the tuning parameter for the MDPDE based on minimizing the asymptotic MSE of the estimator. However, this method depends on the choice of a pilot estimator, and Basak et al. [26] improved the method by removing the dependency on an initial estimator. The proposed algorithm was developed ad hoc for the MDPDE, but it can be easily adapted to the MRPE and RMRPE by simply substituting the expression of the variance of the MDPDE by the variance of the MRPPE or the RMRPE, respectively.
6. Real Data Application
Finally, we illustrate the outperformance of the proposed test statistics in two real data applications, where the gathered information contains some outlying observations. Both real dataset are modeled under the normal model, and hypothesis tests on the standard deviation of the population are performed.
6.1. Telephone-Fault Data
We consider the data on telephone line faults presented and analyzed by Welch [27] and Simpson [4]. The dataset consist of ordered differences between the inverse test rates and the inverse control rates in matched pairs of areas,
Basu et al. [24,28] modeled these differences as a normal random variable and pointed out that the first observation is a clear outlier, as its value is distant from the rest of the data. They tested simple and composite null hypotheses for the mean under the normal model, as well as a simple null hypothesis assuming a known mean. Here, we propose to test for the standard deviation of the normal distribution. Note that, computing the MLE of the sample with full and clean data (after removing the outlying observation), we obtain and respectively. Accordingly, the outlier clearly influences the model parameter estimates, playing a crucial role on the rejection of any null hypothesis. We consider the composite null hypothesis(34)
where the value has been chosen according to the estimation with clean data.Figure 4 presents the RPTS (top) and Rao (bottom) test statistics (left) and p-values (right) for the telephone data against increasing tuning parameters. While it is clearly seen that both classical tests fail to not reject the null hypothesis when fitting the model with the original data, the decision turns around sharply as the tuning parameter crosses and goes beyond for the RPTS and for Rao-type test statistics based on MRPEs. On the other hand, the decision of not rejecting is agreed by all statistics when fitting the model with clean data. This example illustrates the great applicability of the robust methods, which are not too affected by a such outlying observation, and the good performance of the proposed statistics under contaminated observations, which stay stable.
6.2. Darwin’s Plant Fertilization Data
Darwin [29] performed an experiment to determine whether self-fertilized plants and cross-fertilized plants have different growth rates. He sowed in pots pairs of Zea mays plants, one self-fertilized and the other cross-fertilized, and after a specific time period, the height of each plant was measured. A particular sample of pairs of plants led to the following paired differences (cross-fertilized minus self-fertilized).
A parametric approach to analyze the data as a random sample from a normal distribution with unknown mean and standard deviation was developed by Basu et al. [24]. Here, there is not any huge outlying observation, but the first two observations seem to be distant from the rest of the sample, influencing the model parameter estimates and test decisions. Indeed, the MLE, computing with original data, is while the MLE, when removing the two first observations, switches to Therefore, removing influential observations may alter the decision of a test. According to these results, we consider the testing problem
(35)
Figure 5 shows the test statistics (left) and corresponding p-values (right) for the two families of statistics considered, the RPTS (top) and Rao-type test statistics (bottom) against the tuning parameter value . Again, test statistics based on RMRPE with large enough tuning parameters do not reject the null hypothesis, unlike tests based on low values of , including the RMLE. The disagreement departs when using the clean data, as all tests agree on not rejecting the null hypothesis.
7. Concluding Remarks
In this paper, we presented for the first time the family of RMRPEs. We derived their asymptotic distribution, and proved some suitable properties as consistency under the parameter restriction and robustness against data contamination. Further, based on these RMRPEs, we generalized two important families of statistics, namely RPTS and Rao-type tests, for testing a composite null hypothesis. Moreover, we obtained some explicit expressions of the RMPREs, RPTS and Rao-type test statistics for testing the variance under a normal population with an unknown mean. It was empirically shown that the proposed RPTS and Rao-type test statistics are robust, unlike classical tests based on the MLE, under normal populations. Indeed, the robustness of the tests is controlled by a tuning parameter , and so larger values of produce more robust estimators (although less efficient). Finally, some classical numerical examples illustrate the theoretical properties and applicability of the proposed methods.
Conceptualization, M.J., P.M. and L.P.; methodology, M.J., P.M. and L.P.; software, M.J., P.M. and L.P.; validation, M.J., P.M. and L.P.; formal analysis, M.J., P.M. and L.P.; investigation, M.J., P.M. and L.P.; resources, M.J., P.M. and L.P.; data curation, M.J., P.M. and L.P.; writing—original draft preparation, M.J., P.M. and L.P.; writing—review and editing, M.J., P.M. and L.P.; visualization, M.J., P.M. and L.P.; supervision, M.J., P.M. and L.P.; project administration, M.J., P.M. and L.P.; funding acquisition, M.J., P.M. and L.P. All authors have read and agreed to the published version of the manuscript.
Not applicable.
Not applicable.
Not applicable.
We are very grateful to the referees and associate editor for their helpful comments and suggestions.
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
The following abbreviations are used in this manuscript:
DPD | Density Power Divergence |
IF | Influence Function |
KL | Kullback–Leibler |
LRM | Linear Regression Model |
MLE | Maximum Likelihood Estimator |
MDPDE | Minimum Density Power Divergence Estimator |
MRPE | Minimum Rényi Pseudodistance Estimator |
RMRPE | Restricted minimum Rényi Pseudodistance Estimator |
RP | Rényi Pseudodistance |
RPTS | Rényi Pseudodistance test statistic |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure 1. RMSE of the RMRPE under increasing contamination levels (slightly contaminated at left and heavily contaminated at right) for different values of the tuning parameter [Forumla omitted. See PDF.] over [Forumla omitted. See PDF.] replications. (a) Scenario 1, (b) Scenario 2.
Figure 2. Empirical level and power under increasing contamination (slightly contaminated at left and heavily contaminated at right) over [Forumla omitted. See PDF.] repetitions. (a) Scenario 1, (b) Scenario 2.
Figure 2. Empirical level and power under increasing contamination (slightly contaminated at left and heavily contaminated at right) over [Forumla omitted. See PDF.] repetitions. (a) Scenario 1, (b) Scenario 2.
Figure 3. Empirical level under increasing sample sizes for [Forumla omitted. See PDF.] of contamination level (slightly contaminated at left and heavily contaminated at right) over [Forumla omitted. See PDF.] repetitions. (a) Scenario 1, (b) Scenario 2.
Figure 4. RPTS (top) and Rao-type test statistics (bottom), jointly with their associated p-valuess (right), for testing (34) with original and cleaned (after outliers removal) telephone-fault data.
Figure 4. RPTS (top) and Rao-type test statistics (bottom), jointly with their associated p-valuess (right), for testing (34) with original and cleaned (after outliers removal) telephone-fault data.
Figure 5. RPTS (top) and Rao-type test statistics (bottom), jointly with their associated p-values (right), for testing (35) with original and cleaned (after outliers removal) Darwing data.
Figure 5. RPTS (top) and Rao-type test statistics (bottom), jointly with their associated p-values (right), for testing (35) with original and cleaned (after outliers removal) Darwing data.
Appendix A
Appendix A.1. Proof of Theorem 2
We denote
Differentiating both sides of the equality, we have
Now we establish that
We have
As
From the above, after some algebra, we obtain
On the other hand, it not difficult to establish that
Therefore we have
Finally,
On the other hand,
Then, the RMRPE estimator of
However,
Since
Now, we know that
Further, the RMRPE
From (
Now we can express Equations (
Therefore
However,
Now by (
Appendix A.2. Proof of Theorem 3
Consider the expression
It is clear that
Then,
Therefore,
Regarding the second derivatives, we have
Therefore,
Under
Based on
Therefore,
On the other hand, we know that
From Equations (
Therefore, it follows that
Now, observe from the definition that
Then, the asymptotic distribution of the random variables
Next, we apply Corollary 2.1 in Dik and Gunst [
We now establish that
Corollary 14.11.3 in Harville [
On the other hand, we know the following additional properties:
if is full rank (Corollary b.3.3 in Harville [ 31 ] (p. 83)).
-
if dimension of coincides with dimension of
Matrix
Therefore, we have
Appendix A.3. Rényi’s Pseudodistance between Normal Populations
Here, we compute the expression of the RP between densities belonging to the normal model with parameters
We first compute
In relation with
Now it is necessary to obtain A and
Then,
We have,
Therefore,
On the other hand,
However,
Therefore,
Then,
For
Appendix A.4. Computation of the Nonzero Eigenvalues of Aγ(θ0)Bτ(θ0)Kτ(θ0)Bτ(θ0)
We know that the matrix
Then,
Therefore,
On the other hand
Then,
Now we obtain the elements of that matrix,
Now we obtain the matrix
Then,
On the other hand
Therefore
Now we have,
-
The matrix
-
The matrix
-
The matrix
-
The matrix
References
1. Beran, R. Minimum Hellinger distance estimates for parametric models. Ann. Stat.; 1977; 5, pp. 445-463. [DOI: https://dx.doi.org/10.1214/aos/1176343842]
2. Tamura, R.N.; Boos, D.D. Minimum Hellinger distance estimation for multivariate location and covariance. J. Am. Stat. Assoc.; 1986; 81, pp. 223-229. [DOI: https://dx.doi.org/10.1080/01621459.1986.10478264]
3. Simpson, D.G. Minimum Hellinger distance estimation for the analysis of count data. J. Am. Stat. Assoc.; 1987; 82, pp. 802-807. [DOI: https://dx.doi.org/10.1080/01621459.1987.10478501]
4. Simpson, D.G. Hellinger deviance tests: Efficiency, breakdown points, and examples. J. Am. Stat. Assoc.; 1989; 84, pp. 107-113. [DOI: https://dx.doi.org/10.1080/01621459.1989.10478744]
5. Lindsay, B.G. Efficiency versus robustness: The case for minimum Hellinger distance and related methods. Ann. Stat.; 1994; 22, pp. 1081-1114. [DOI: https://dx.doi.org/10.1214/aos/1176325512]
6. Pardo, L. Statistical Inference Based on Divergence Measures; Chapman & Hall/CRC: Boca de Raton, FL, USA, 2006.
7. Basu, A.; Shioya, H.; Park, C. Statistical Inference: The minimum Distance Approach; Chapman & Hall/CRC Press: Boca de Raton, FL, USA, 2011.
8. Broniatowski, M.; Toma, A.; Vajda, I. Decomposable pseudodistances and applications in statistical estimation. J. Stat. Plan. Inference; 2012; 142, pp. 2574-2585. [DOI: https://dx.doi.org/10.1016/j.jspi.2012.03.019]
9. Castilla, E.; Jaenada, M.; Pardo, L. Estimation and testing on independent not identically distributed observations based on Rényi’s pseudodistances. IEEE Trans. Inf. Theory; 2022.in press [DOI: https://dx.doi.org/10.1109/TIT.2022.3158308]
10. Silvey, S.D. Reprinting, Monographs on Statistical Subjects; Chapman and Hall: London, UK, 1975.
11. Basu, A.; Mandal, A.; Martin, N.; Pardo, L. Testing Composite Hypothesis Based on the Density Power Divergence. Sankhya B Indian J. Stat.; 2018; 80, pp. 222-262. [DOI: https://dx.doi.org/10.1007/s13571-017-0143-0]
12. Ghosh, A. Influence function analysis of the restricted minimum divergence estimators: A general form. Electron. J. Stat.; 2015; 9, pp. 1017-1040. [DOI: https://dx.doi.org/10.1214/15-EJS1025]
13. Jones, M.C.; Hjort, N.L.; Harris, I.R.; Basu, A. A comparison of related density-based minimum divergence estimators. Biometrika; 2001; 88, pp. 865-873. [DOI: https://dx.doi.org/10.1093/biomet/88.3.865]
14. Fujisawa, H.; Eguchi, S. Robust parameter estimation with a small bias against heavy contamination. J. Multivariante Anal.; 2008; 99, pp. 2053-2081. [DOI: https://dx.doi.org/10.1016/j.jmva.2008.02.004]
15. Castilla, E.; Jaenada, M.; Martin, N.; Pardo, L. Robust approach for comparing two dependent normal populations through Wald-type tests based on Rényi’s pseudodistance estimators. arXiv; 2022; arXiv: 2202.00982
16. Toma, A.; Leoni-Aubin, S. Robust tests based on dual divergence estimators and saddlepoint approximations. J. Multivariante Anal.; 2010; 101, pp. 1143-1155. [DOI: https://dx.doi.org/10.1016/j.jmva.2009.11.001]
17. Toma, A.; Karagrigoriou, A.; Trentou, P. Robust model selection criteria based on pseudodistances. Entropy; 2020; 22, 304. [DOI: https://dx.doi.org/10.3390/e22030304] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33286078]
18. Castilla, E.; Martin, N.; Muñoz, S.; Pardo, L. Robust Wald-type tests based on Minimum Rényi Pseudodistance Estimators for the Multiple Regression Model. J. Stat. Comput. Simul.; 2020; 14, pp. 2592-2613. [DOI: https://dx.doi.org/10.1080/00949655.2020.1787410]
19. Castilla, E.; Ghosh, A.; Jaenada, M.; Pardo, L. On regularization methods based on Rényi’s pseudodistances for sparse high-dimensional linear regression models. arXiv; 2022; arXiv: 2202.00982
20. Jaenada, M.; Pardo, L. The minimum Renyi’s Pseudodistances estimators for Generalized Linear Models. Data Analysis and Related Applications: Theory and Practice; Proceeding of the ASMDA Wiley: Athens, Greece, 2021.
21. Jaenada, M.; Pardo, L. Robust Statistical Inference in Generalized Linear Models Based on Minimum Renyi’s Pseudodistance Estimators. Entropy; 2022; 24, 123. [DOI: https://dx.doi.org/10.3390/e24010123]
22. Hampel, F.R.; Ronchetti, E.; Rousseauw, P.J.; Stahel, W. Robust Statistics: The Approach Based on Influence Functions; John Wiley & Sons: Hoboken, NJ, USA, 1986.
23. Rao, C.R. Score test: Historical review and recent developments. Advances in Ranking and Selection, Multiple Comparisons, and Reliability; Birkhäuser: Boston, MA, USA, 2005; pp. 3-20.
24. Basu, A.; Ghosh, A.; Martin, N.; Pardo, L. A Robust Generalization of the Rao Test. J. Bus. Econ. Stat.; 2021; 40, pp. 868-879. [DOI: https://dx.doi.org/10.1080/07350015.2021.1876711]
25. Warwick, J.; Jones, M.C. Choosing a robustness tuning parameter. J. Stat. Comput. Simul.; 2005; 75, pp. 581-588. [DOI: https://dx.doi.org/10.1080/00949650412331299120]
26. Basak, S.; Basu, A.; Jones, M.C. On the optimal density power divergence tuning parameter. J. Appl. Stat.; 2021; 48, pp. 536-556. [DOI: https://dx.doi.org/10.1080/02664763.2020.1736524]
27. Welch, W.J. Rerandomizing the median in matched-pairs designs. Biometrika; 1987; 74, pp. 609-614. [DOI: https://dx.doi.org/10.1093/biomet/74.3.609]
28. Basu, A.; Mandal, A.; Martin, N.; Pardo, L. Testing statistical hypotheses based on the density power divergence. Ann. Inst. Stat. Math.; 2013; 65, pp. 319-348. [DOI: https://dx.doi.org/10.1007/s10463-012-0372-y]
29. Darwin, C. The Effects of Cross and Self Fertilisation in the Vegetable Kingdom; AMS Press Inc.: New York, NY, USA, 1877.
30. Dik, J.J.; de Gunst, M.C.M. The Distribution of General Quadratic Forms in Norma. Stat. Neerl.; 1985; 39, pp. 14-26. [DOI: https://dx.doi.org/10.1111/j.1467-9574.1985.tb01121.x]
31. Harville, D.A. Matrix Algebra from a Statistician’s Perspective; Springer: New York, NY, USA, 2008.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
The Rao’s score, Wald and likelihood ratio tests are the most common procedures for testing hypotheses in parametric models. None of the three test statistics is uniformly superior to the other two in relation with the power function, and moreover, they are first-order equivalent and asymptotically optimal. Conversely, these three classical tests present serious robustness problems, as they are based on the maximum likelihood estimator, which is highly non-robust. To overcome this drawback, some test statistics have been introduced in the literature based on robust estimators, such as robust generalized Wald-type and Rao-type tests based on minimum divergence estimators. In this paper, restricted minimum Rényi’s pseudodistance estimators are defined, and their asymptotic distribution and influence function are derived. Further, robust Rao-type and divergence-based tests based on minimum Rényi’s pseudodistance and restricted minimum Rényi’s pseudodistance estimators are considered, and the asymptotic properties of the new families of tests statistics are obtained. Finally, the robustness of the proposed estimators and test statistics is empirically examined through a simulation study, and illustrative applications in real-life data are analyzed.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer