1. Introduction
Mudholkar and Hutson (2000) [1] studied an asymmetric normal distribution that they called the epsilon–skew–normal , with asymmetry or skewness parameter When the parameter assumes the value 0, the distribution simplifies to become a standard normal distribution. The family thus consists of a parameterized set of usually asymmetric distributions that includes the symmetric standard normal density as a special case. Specifically, we say that if its density is of the form:
where , is the standard normal density and is the sign function.Arellano-Valle et al. (2005) [2] discuss extension of this model, together with associated inference procedures. They consider a class of Epsilon-skew-symmetric distributions associated with a particular symmetric density that is indexed by an asymmetry parameter with densities given by
(1)
where and .If X has density of the form (1), then we say that X is an epsilon-skew-symmetric random variable and we write . Arellano-Valle et al. (2005) [2] extend this family to the model epsilon-skew-exponential-power, a model that has major and minor asymmetry and kurtosis that the ESN model. On the other hand Gómez et al. (2007) [3] study the Fisher information matrix for epsilon-skew-t model, which was used before in the study a financial series by Hansen (1994) [4]; see also Gómez et al. (2008) [5]. Note that if in (1) we set , we obtain the epsilon–skew–Cauchy model.
We will write to indicate that X has a standard normal distribution, and we will write to indicate that Y has a standard half-normal distribution, i.e., that where
The distribution of the ratio of two random variables is of interest in problems in biological and physical sciences, econometrics, and ranking and selection. It is well known that if , and are independent, then the random variables and both have Cauchy distributions; see Johnson et al. (1994, [6] Chapter 16). Behboodian, et al. (2006) [7] and Huang and Chen (2007) [8] study the distribution of such quotients when the component random variables are skew-normal (of the form studied in Azzalini (1985) [9]). The principle objective of the present paper is to study the behavior of such quotients when the component random variables have epsilon-skew-normal distributions.
The paper is organized in the following manner. In Section 2, we describe a representation of the epsilon–skew–Cauchy model. In Section 3, we consider the distribution of the ratio of two independent random variables, one of which has an distribution and the other a standard normal distribution. In addition, an extension of the epsilon–skew–Cauchy (ESC) distribution is introduced. Bivariate versions of these distributions are discussed in Section 4. Extensions to higher dimensions can be readily envisioned, but are not discussed here. In Section 5, some of the bivariate distributions introduced in this paper are considered as possible models for a particular real-world data set.
2. Representation of the ESC (Epsilon–Skew–Cauchy) Model
If and are independent random variables, then has an epsilon–skew–Cauchy distribution with asymmetry parameter ε and density given by
where , and we write .
With the transformation and , whose Jacobian , we obtain
where .It follows directly that
☐Figure 1 depicts the behavior of the ESC density for a variety of parameter values.
3. Generalized Cauchy Distribution
If and are independent random variables, then has what we call a generalized epsilon–skew–Cauchy (GESC) distribution with parameters and and with density given by
where , , and we write .
Arguing as in Proposition 1, we have that
By considering separately the cases in which and , result follows directly. ☐From Proposition 2 two special cases are obtained directly,
-
If a generalized Cauchy (GC) distribution is obtained. In this case we write , and its pdf is given by
where , and where and are independent random variables. -
If an epsilon–skew–Cauchy distribution is obtained. In this case we write , and its pdf is given by
where , and where and are independent random variables.
4. General Bivariate Mudholkar-Hutson Distribution
Define to be i.i.d. standard normal random variables. For define
where , , , , , are positive numbers and . So, the parameters and indicate the propensity in which the discrete random variable takes negative and positive values, respectively. The parameters control how often negative and positive values are taken by .In addition assume that all six random variables and are independent, and define
(2)
The model (2) is highly flexible since it allows for different behavior in each of the four quadrants of the plane. From (2) it may be recognized that only fractional moments of X and Y exist.
Note that if we define , it is readily verified that
(3)
in which case we say that has a standard bivariate half-Cauchy distribution.Using (3) and conditioning on and we obtain the density of in the albeit complicated form:
(4)
Some special cases which might be considered include the following:
Mudholkar and Hutson type. For this we set: , and for .
In this case the density (4) simplifies somewhat to become:
(5)
A further specialization of the density (5) can be considered, as follows.
Homogenous Mudholkar and Hutson type. For this we set: , and for .
This homogeneity results in a little simplification of (4), thus:
(6)
It is easy to see that the parameter is not identifiable in (6) because . An adjustment to ensure identifiability involves introducing the constraint .
Equal weights. In this case we assume that are positive numbers and for .
(7)
The pdf (7) is not identifiable because the values of can be interchanged with those of and does not change. Moreover, multiplying all of the ’s and ’s by a constant does not change . So, one way to get identifiability in the model (7) is to set and equal to 1. In that case, Equation (7) takes the form
However, this is then recognizable as being simply a scaled version of the standard bivariate Cauchy density (compare with Equation (3)).
We now consider the marginal densities for the random variable defined by (2). From (2) we have and and the density of is a standard half-Cauchy density, i.e., .
Consequently, the density of X is of the form
which is a mixture of half-Cauchy densities. The density of Y is then of the same form with replaced by .To get a more direct bivariate version of the original Mudholkar-Hutson density in which the ’s, ’s and ’s were related to each other, we go back to the original representation, i.e., Equation (2), but now we set , , , , , and . So the density is of the form
with marginal densityA more general version of the density with is of the form: (without loss of generality set )
with corresponding marginalNeedless to say we can consider analogous models in which, instead of assuming that has a density of the form (3) we assume that it has another bivariate density with support , e.g., bivariate normal restricted to , or bivariate Pareto, etc.
Another general bivariate MH model which includes the bivariate skew-Cauchy distribution given by (2) is the bivariate skew t model. This model can be obtained replacing by in Equation (2). Thus, if we assume that all six random variables , ; and are independent, and define
(8)
then, because has density the density of is(9)
The pair of variables in (8) allows one to model a wider variety of paired data sets than that given in (2) because it also can model light and heavy “tails” in a different way for each quadrant of the coordinate axis. Additionally, it is possible to compute the order moments.
The expected value of in (8) is given by
provided that .
For it is immediate that and .
On the other hand, since then
Therefore, from (8) we have
and the result is obtained straightforward. ☐Following the proof of the previous proposition it is possible to obtain the order moments provided that .
In applications, it will usually be appropriate to augment these models by the introduction of location, scale and rotation parameters, i.e., to consider
where and is positive definite.5. Application
The data that we will use were collected by the Australian Institute of Sport and reported by Cook and Weisberg (1994) [10]. The data set consists of values of several variables measured on Australian athletes. Specifically, we shall consider the pair of variables (Ht,Wt) which are the height (cm) and the weight (Kg) measured for each athlete.
We fitted the bivariate Mudholkar-Hutson distributions for five different cases. In addition to the general case which is given by the pdf , based on (9), where , and is the location parameter and
is the symmetric positive definite scale matrix, we also consider four special cases:When the pdf is given by Equation (4). That is taking in the bivariate skew t MH model.
When the pdf is given by
which is the bivariate MH distribution specified using the special case (1.), where for all .When the pdf is given by
which is the bivariate MH distribution specified using the special case (2.), where .When the pdf is given by
which is the bivariate Cauchy distribution (as in (3)).
All fits were done by maximizing the likelihood through numerical methods which combine algorithms based on the Hessian matrix and the Simulated Annealing algorithm. Standard errors of the estimations were computed based on 1000 bootstrap data samples.
To compare model fits, we used the Akaike criterion (see Akaike, 1974 [11]), namely
where k is the dimension of which is the vector of parameters of the model being considered.Table 1 displays the results of the fits for 100 women. In Table 1, we see the results of fitting the five competing models. They are the general bivariate skew MH model with pdf given in Equation (9), General bivariate MH with pdf given in Equation (4), Bivariate MH (1.) with pdf given in Equation (5), Bivariate MH (2.) with pdf given in Equation (6) and Bivariate Cauchy. It shows the maximum likelihood estimates (mle’s) of the five models. The last column shows the estimated standard errors (se) of the estimates. Non identifiability of the bivariate skew MH model is evidenced by the huge value of the estimated standard error of the estimate of . However, the AIC criterion indicates that data are better fitted by the general bivariate skew MH (9) model. Figure 2 shows the contour lines of the fitted pdf.
Table 2 displays the results of the fits for 102 men. In Table 2 we again compare five competing models. The maximum likelihood estimates (mle’s) of the five models, the corresponding Akaike criterion values and estimated standard errors of the estimates are displayed in the table. Again, the AIC indicates that the data are better fitted by the general bivariate skew MH (9) model. Figure 3 shows the contour lines of the fitted pdf. Non identifiability of the general bivariate skew MH (9) and General bivariate MH (4) models is shown by the large values of the estimated standard errors for some estimates.
Table 3 displays the results of the fits for the full set of Australian athletes, regardless of gender. Table 3 shows the maximum likelihood estimates (mle’s) of the five models together with the corresponding Akaike criterion values and estimated standard errors of the estimates. The AIC here also indicates that the data are better fitted by the general bivariate skew MH model (9). Figure 4 shows the contour lines of the fitted pdf. Thus, in all three cases, males, females and combined, the best fitting model was the general bivariate skew MH.
For the three real cases analyzed the general bivariate skew-t MH model (9) was indicated as the best fitted. That means that it seems worth considering the more general model to explain the variability of these data sets.
6. Concluding Remarks
The Mudholkar–Hutson skewing mechanism admits flexible extensions in both the univariate and multivariate cases. Stochastic representations of such extended models typically have corresponding likelihood functions that are somewhat complicated (for example Equations (4) and (9)). This is partly compensated for by the ease of simulation for such models using the representation in terms of latent variables ( the ’s and ’s in (2) or (8)). The data set analyzed in Section 5 illustrates the potential advantage of considering the extended bivariate Mudholkar-Hutson model, since the basic bivariate M-H model does not provide an acceptable fit to the “athletes” data. Needless to say, applying Occam’s razor, it would always be desirable to consider the hierarchy of the five nested bivariate models that were considered in Section 5, in order to determine whether one of the simpler models might be adequate to describe a particular data set. Indeed there may be other special cases of the model (4), intermediate between (4) and (5) say, that might profitably be considered for some data sets. However, it should be remarked that selection of such sub-models for consideration should be done prior to inspecting the data. The addition of the GBMH model to the data analysts tool-kit should provide desirable flexibility for modeling data sets which exhibit behavior somewhat akin to, but not perfectly adapted to, more standard bivariate Cauchy models.
Author Contributions
All the authors contributed significantly to this research article.
Funding
This research received no external funding.
Acknowledgments
The research of H. Varela and H.W. Gómez was supported by Grant SEMILLERO UA-2019 (Chile).
Conflicts of Interest
The authors declare no conflict of interest.
Figures and Tables
Figure 1. Examples of the ESC(ε) density for: ε=0 (green line), ε=−0.5 (blue line), ε=−0.8 (black line), ε=0.5 (red line) and ε=0.8 (pink line).
Figure 2. Female Australian athletes data: scatter plot (Ht, Wt) and fitted General bivariate MH.
Figure 3. Male Australian athletes data: scatter plot (Ht, Wt) and fitted General bivariate MH.
Figure 4. Australian athletes data: scatter plot (Ht, Wt) and fitted General bivariate MH. Red point for Men and sign + for women.
Bivariate Mudholkar-Hutson fits for women.
Model | AIC | Estimates | Bootstrap Standard Errors |
---|---|---|---|
Bivariate |
|||
General bivariate |
|||
Bivariate |
|||
Bivariate |
|||
Bivariate |
|
|
Bivariate Mudholkar-Hutson fits for men.
Model | AIC | Estimates | Bootstrap Standard Errors |
---|---|---|---|
Bivariate |
|||
General bivariate |
|||
Bivariate |
|||
Bivariate |
|||
Bivariate |
|
|
Bivariate Mudholkar-Hutson fits.
Model | AIC | Estimates | Bootstrap Standard Errors |
---|---|---|---|
Bivariate |
|||
General bivariate |
|||
Bivariate |
|||
Bivariate |
|||
Bivariate |
|
|
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2019 by the authors.
Abstract
In this paper, we consider a stochastic representation of the epsilon–skew–Cauchy distribution, viewed as a member of the family of skewed distributions discussed in Arellano-Valle et al. (2005). The stochastic representation facilitates derivation of distributional properties of the model. In addition, we introduce symmetric and asymmetric extensions of the Cauchy distribution, together with an extension of the epsilon–skew–Cauchy distribution. Multivariate versions of these distributions can be envisioned. Bivariate examples are discussed in some detail.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Statistics Department, University of California Riverside, Riverside, CA 92521, USA
2 Departamento de Matemática, Universidad de Antofagasta, Antofagasta 1240000, Chile
3 Instituto de Matemática y Física, Universidad de Talca, Casilla 747, Talca, Chile