1. Introduction
The use of multivariate normal (MN) distribution plays a central role in statistical modeling. However, there are some situations where the data are not in agreement with the MN distribution. Departure from normality can take place in different ways, such as multimodality, lack in central symmetry, and positive or negative excess of kurtosis. The class of scale mixtures of skew-normal distributions (SMSN) whose general form was first introduced by Branco and Dey [1] includes many multivariate skew symmetric (MSS) distributions with only one mode.
More formally, a scale mixture distribution can be constituted by mixing a base density over a scaling distribution. Its density can be expressed in the form of the integral given by
(1)
where is the conditional density of a random vector given . Herein, is a positive function of a scaling variable with cumulative distribution function (cdf) , indexed by the parameter vector .Using (1), the family of SMSN distributions can be generated by assuming a multivariate skew-normal (MSN) distribution [2] with location , scale covariance matrix , and shape parameter for . The marginal probability density function (pdf) of can be obtained as follows:
(2)
where denotes the pdf of the p-variate MN distribution with mean vector and covariance matrix , and is the cdf of the standard normal distribution. Some simple scaling functions such as and lead to well-known distributions. As described by Branco and Dey [1], some remarkable examples are the multivariate skew-t (MST), multivariate skew-slash (MSSL), and multivariate skew-contaminated-normal (MSCN) distributions, to name just a few. The SMSN family collapses to the class of scale mixture of normal distributions when the skewness parameter vanishes. The MSN distribution [2] with the parameterization given by Arellano-Valle and Genton [3] can be recovered when is degenerated by imposing .The multivariate skew-scale mixtures of normal (MSSMN) distributions [4] is established when defined in (1) is given by
(3)
Therefore, the pdf of MSSMN distribution is obtained as follows:(4)
Recently, Arellano-Valle et al. [5] proposed a multivariate class of scale-shape mixtures of skew-normal (MSSMSN) distributions which provides alternative candidates for modeling asymmetric data. A convenient hierarchical representation of the MSSMSN distribution is given by
(5)
where is a mixing vector with a joint cdf , and is a real-valued shape mixing function which is not necessarily symmetric about zero. The family of MSSMSN distributions encapsulates several renowned unimodal asymmetric distributions generated by varying the scale and shape functions, and , for a given distribution of , or alternatively by fixing and but varying the distribution of . A convenient setup for the mixing functions is to choose and , leading to a particular form of the shape mixture of SMSN distributions. If we choose , it produces another form of shape mixtures of MSSMN distributions. See [5] for a more comprehensive discussion and detailed investigation.As shown by Azzalini and Capitanio [6], the pdf of the MSS distribution can be written as follows:
(6)
where and are, respectively, the location vector and scale covariance matrix; is a p-variate centrally symmetric pdf with respect to the origin, i.e., ; is a univariate cdf satisfying ; and is an odd real-valued function, namely . The MSS class is equivalent to the one studied by Wang et al. [7] for which is replaced by satisfying .In light of (6), it is possible to generate a wide range of asymmetric families of unimodal and multimodal skew distributions depending on the specification of the function . This essential property induces greater flexibility in the available shapes. For example, Ma and Genton [8] proposed a flexible class of skew-symmetric distributions by choosing , where is an odd polynomial function of order K.
The Hadamard product, also known as the Schur product, is a type of matrix multiplication that is commutative and simpler than the matrix product. See [9] for a comprehensive review and its applications to multivariate statistical analysis. The Hadamard product is advantageous in computations and algebraic manipulations because the products are entry-wise, the multiplication is commutative, and, particularly, the inverse is very easy to obtain and the computation of power matrices is straightforward. By convention, we use “⊙" to denote the Hadamard operations (product and power). Let and be two matrices of the same dimension but not necessarily square. Then, the Hadamard product between these two matrices, denoted by , is the element-wise product of and , that is, a matrix whose entry is . Accordingly, the nth Hadamard power of matrix is defined as . More key properties concerning the multiplication and partial derivatives of the Hadamard product are deferred to Appendix A.
In the multivariate setup, the odd polynomial of order K has various combinations of the coefficients. For example, the bivariate case under an odd polynomial of order is given by . As an alternative to that of Ma and Genton [8] in more a general setting, we introduce the multivariate flexible skew-symmetric-normal (MFSSN) distribution, denoted by , which has the following pdf:
(7)
where is a multiple-scaled vector of shape parameters and remains a vector through an odd Hadamard power transformation of order , for . Notably, the MFSSN distribution encompasses the flexible generalized skew-normal (FGSN) distribution introduced by Ma and Genton [8] as the univariate case. Figure 1 illustrates the scatter-contour plots coupled with their marginal histograms of the bivariate MFSSN distribution under , and various specifications of shape parameters arisen from two setups of m. As can be seen, many different non-elliptically distributional shapes with multiple modes and asymmetric patterns can be produced. Additional flexibility can be gained by expanding the thickness of tails. This motivates us to propose a more general class of scale-shape mixtures of MFSSN (SSMFSSN) distributions.The class of SSMFSSN distributions can be hierarchically represented by
(8)
where . Further, we denote the pdf of by . From (8), the marginal pdf of is given by(9)
Obviously, the MFSSN model is obtained by setting in (9).The family of SSMFSSN distributions introduced in (8) is quite vast, containing several subfamilies of asymmetric and multimodal distributions which have never been considered in the literature. Notice that the MSSMSN distribution described in (5) is a simple case of SSMFSSN by taking , , and , for . More importantly, the scale-shape mixtures of flexible generalized skew-normal (SSMFGSN) distributions proposed very recently by Mahdavi et al. [10] can be thought of as a univariate case of SSMFSSN when the dimension .
The EM algorithm [11] and some of its extraordinary variants such as the expectation conditional maximization (ECM) algorithm [12] and the expectation conditional maximization either (ECME) algorithm [13] are broadly applicable methods to carry out ML estimation for multivariate skew distributions in a complete-data framework. To the best of our knowledge, previous developments of the EM-type algorithm are based on the convolution-type representations, see, e.g., [14] for the MSN distribution, [15] for the MST distribution, [1] for the SMSN distribution, and [5] for the MSSMSN distribution. Since our proposed SSMFSSN model cannot be explicitly expressed by a convolution-type representation, we develop a novel EM-based procedure designed under the selection mechanism to compute the ML estimates.
The rest of the paper is organized as follows. Section 2 presents the formulation of the general SSMFSSN model and discusses how to deploy the ECME algorithm for ML estimation based on the selection-type mechanism. Section 3 exemplifies some particular cases of SSMFSSN distributions constructed by setting different mixing distributions for . The proposed techniques are illustrated by conducting two simulation studies in Section 4 and analyzing a real data example in Section 5. We conclude in Section 6 with a few remaks and offer directions for future research.
2. Methodology
2.1. The Family of SSMFSSN Distributions
A p-variate random vector ∼ is asserted to follow the SSMFSSN distribution with location vector , scale covariance matrix , shape parameters , and flatness parameters if it has the following selection-type representation:
(10)
where , and ∼. Herein, ‘’ stands for equality in distribution, and U is obviously a continuous random variable symmetric about zero. Using this characterization, the random samples for can be simulated through the following scheme(11)
As a result, the random samples of the general can be obtained by the affine transformation .
For fitting the SSMFSSN model (10) within the complete-data framework via the EM-type algorithm, we introduce two latent variables and . Then, has the following joint pdf:
(12)
where , is the pdf of evaluated at point .Integrating out W from (12) gives the following joint pdf
(13)
Therefore, the marginal pdf of is given by
(14)
where the second equality holds if we further assume and are mutually independent. It is noteworthy that the shape mixtures of MSSMN distribution established by Arellano-Valle et al. [5] also belongs to the family of our proposed SSMFSSN model by imposing .Dividing (12) by (13) yields the following relation
(15)
for which ‘≡’ in (15) means that W and are conditionally independent. Moreover, it is straightforward to show that(16)
where represents a doubly truncated normal distribution confined within the interval , and is an indicator function of set . Using Lemma 2 of Lin et al. [16], we have the following conditional expectation:(17)
By Bayes’ rule, the conditional pdf of given is
(18)
2.2. Parameter Estimation via the ECME Algorithm
The EM algorithm [11] is a widely used iterative technique to deal with ML estimation in models that involve incomplete data or latent variables. One primary virtue of EM lies in the fact of attractive monotone convergence properties and the preservation of simplicity and stability. In practice, a major limitation of EM is often that some estimators in the M-step cannot be solved in terms of closed-form expressions. To overcome this obstacle, the ECM algorithm proposed by Meng and Rubin [12] recommends replacing the E-step of EM with a sequence of simpler conditional maximization (CM) steps, yet it also enjoys the same convergence properties as EM. However, in certain problems, some of the CM-steps of ECM may become analytically intractable or suffer from slow convergence. As a further flexible extension, the ECME algorithm [13] divides the CM-steps of ECM to maximize either the Q-function, called the CMQ-step, or the corresponding constrained actual log-likelihood function, named as the CML-step. In what follows, we describe in greater detail how the proposed SSMFSSN model can be fitted by using the ECME algorithm.
Suppose that constitutes a set of p-dimensional observed samples of size n arising from the SSMFSSN model. Under the EM framework, the latent variables and introduced in the preceding subsection are treated as missing data. Then, the complete data are given by . Further, we let denote the entire unknown parameters to be estimated in the SSMFSSN model, where is the half-vectorization operator that stacks the lower triangular elements of a symmetric matrix into a single vector.
According to (12), the log-likelihood function of corresponding to the complete-data , excluding additive constants and terms that do not involve parameters of the model, is given by
(19)
with where is a vector of ones, , is a diagonal matrix containing the elements of on the main diagonal, and is a reparameterized matrix.On the kth iteration, the E-step requires the calculation of the so-called Q-function, which is the conditional expectation of (19) given the observed data and the current estimate , where the superscript denote the updated estimates at iteration k. To evaluate the Q-function, we require the following conditional expectations:
(20)
which have explicit expressions that are discussed in detail in a subsequent section along with(21)
which may not have standard form for some subfamilies. Substituting (20) and (21) into (19) yields the following Q-function:(22)
The CM-steps are implemented to update estimates of in the order of , , and by maximizing, one by one, the Q-function obtained in the E-step. After some algebraic manipulations, they are summarized by the following CMQ and CML steps:
CMQ-Step 1: Fixing and , we update via Proposition A2 by taking the partial derivative of (22) with respect to . Since the derivation cannot get a closed-form expression for its maximizer, the solution of is validated by numerically solving the root of the following equation:
(23)
where the two terms and are nonlinear functions of with .CMQ-Step 2: Fixing and then updating by maximizing (22) over gives
(24)
CMQ-Step 3: Fixing , we update via Proposition A3 by taking the partial derivative of (22) with respect to each, . Since their solutions cannot be isolated and set equal to zeros, we have the following equation for finding the nonlinear roots of :
(25)
where is a nonlinear function of . After simplification, we can transform back to(26)
Collecting the above solutions turn out to be .
For some members of SSMFSSN, the calculation of is not straightforward. An update of can be achieved by directly maximizing the constrained actual log-likelihood function. This gives rise to the following CML-step:
CML-Step: is updated by optimizing the following constrained log-likelihood function:
(27)
Note that the maximization in the above CML-step requires p-dimensional search of the objective function (constrained log-likelihood), which can be easily accomplished by using, for example, the
On the initialization of parameters for starting the algorithm, the location vector and the scale covariance matrix are specified as the sample mean vector and sample covariance matrix, respectively. The initial values for the shape parameters are taken as the sample skewness of p variables, while the remaining are fixed around 0. As for , their initial values are chosen as relatively small values depending on the settings of parameter domain. To avoid getting stuck in one of the many local maxima of the likelihood function, a convenient method is to try a variety different of initial values with perturbations or using the bootstrap resampling method [14]. The solution with the highest log-likelihood value is treated as the ML estimates, denoted by .
3. Examples of SSMFSSN Distributions
We present some special cases of SSMFSSN distributions which are induced by setting different mixing distributions for . For each case, additional conditional expectations are also derived for the implementation of ECME.
3.1. The Multivariate Flexible Skew-Symmetric-t-Normal Distribution
The multivariate flexible skew-symmetric-t-normal (MFSSTN) distribution, denoted by ∼, is produced by taking and in (10). In this case, , where represents the pdf of the gamma distribution with mean . The pdf of the MFSSTN distribution can be expressed as
(28)
where stands for the pdf of p-dimensional multivariate t distribution with location vector , scale covariance matrix , and degree of freedom (DOF) equal to . One special case of the MFSSTN distribution is the multivariate skew-t-normal (MSTN) distribution of Lin et al. [18], obtained by letting for . In addition, the MFSSTN distribution reduces to MFSSN as .According to (18), it is easy to observe that ∼, where denotes the squared Mahalanobis distance. Therefore,
(29)
where is the digamma function.From (20) and (21), the E-step involves the calculation of , , and , which can be easily evaluated via the results of (17) and (29). As an alternative way of estimating , the CML-Step for the MFSSTN distribution can be altered to the following CMQ-Step.
-
CMQ-Step 4: is obtained by solving the root of the following equation:
(30)
3.2. The Multivariate Flexible Skew-Symmetric-Slash-Normal Distribution
Let be a p-dimensional random vector with the following representation
(31)
where ∼ and is independent of ∼, where U is the uniform distribution on the interval . From (31), the conditional distribution given is . Then, has a multivariate slash distribution with pdf given by(32)
where denotes the cdf of the gamma distribution with scale parameter 1 and shape parameter r. Using the law of iterated expectations, the mean vector and variance-covariance matrix of are(33)
If we select ∼ and for (10), this generates the multivariate flexible skew-symmetric-slash-normal (MFSSSLN) distribution, denoted by ∼, with the pdf taking the form of
(34)
Note that the MFSSSLN distribution contains the MFSSN distribution as a limiting case for and encloses the multivariate skew-slash distribution considered by Wang and Genton [19] as a reduced case when .
According to (18), the conditional distribution is given by
(35)
In addition, some algebraic manipulations yield
(36)
and(37)
To implement the ECME procedure for fitting MFSSSLN, the conditional expectations involved in (20) and (21) can be easily evaluated via the results of (17), (36), and (37). Besides, the DOF can be alternatively updated by maximizing the Q-function over , leading to the following CMQ-Step:
CMQ-Step 4:
(38)
3.3. The Multivariate Flexible Skew-Symmetric-Contaminated-Normal Distribution
The multivariate flexible skew-symmetric-contaminated-normal (MFSSCN) distribution, denoted by ∼, arises when , while has a discrete distribution taking value with probability and value 1 with probability . More precisely,
(39)
where is the proportion of outliers and is an inflation parameter denoting the degree of contamination.Using (14), the pdf of is given by
(40)
Obviously, the MFSSCN distribution reduces to the MFSSN distribution when , to the multivariate skew-contaminated-normal distribution [4] when , and is said to follows the “MFSSCNe" distribution if and are restricted to be equal, namely .
To obtain , we require the following conditional expectation
(41)
The resulting Q-function can be easily evaluated through (17) and (41) since . To estimate and , we perform the CML-Step, so the calculation of can be omitted.
3.4. The Multivariate Flexible Skew-Symmetric-t Distribution
The multivariate flexible skew-symmetric-t (MFSST) distribution, denoted by ∼, is created by setting with . Utilizing (8), the hierarchical representation for can be simplified as
(42)
where . Therefore, it can be verified that the MFSST distribution has the following pdf(43)
where denotes the cdf of the t distribution with DOF . The detailed proof of (43) is sketched in Appendix B. It is notable that the MFSST distribution includes the multivariate skew-t distribution of Azzalini and Capitanio [6] as a particular member by letting and the MFSSN distribution as a limiting case when grows to infity.Using (18) subject to , it suffices to show that
(44)
where .With arguments similar to those of Lin et al. [20], it is straightforward to derive
(45)
and(46)
whereAdditionally, using the law of iterated expectations, we can deduce that
(47)
As a consequence, the E-step in (20) and (21) requires the calculation of , , and , which can be directly evaluated via (45)–(47). Moreover, the procedure of updating is the same as (30).
3.5. The Multivariate Flexible Skew-Symmetric-t-t Distribution
Consider two independent random variables ∼ and ∼ with joint pdf given by
(48)
Using (14), we thus generate the multivariate flexible skew-symmetric-t-t (MFSSTT) distribution, denoted by ∼, whose pdf is of the form
(49)
When the two DOFs are restricted to be equal, namely , is said to follow the ‘MFSSTTe’ distribution. One thing worth noting is that the MFSSTT distribution reduces to the MFSSN distribution when , to the MFSSTN distribution by letting and , and embeds the multivariate skew-t-t (MSTT) distribution introduced by Wang et al. [21] as a special case under the setting of .
According to (18), it is easy to see ∼. The conditional pdf of given is
(50)
Further, we have the following conditional expectations:
(51)
and(52)
Applying the law of iterated expectations to (15) and (50) gives
(53)
With slight modifications as defined in (20) and (21), the necessary conditional expectations in the E-step include , , , , and , which can be easily evaluated via the results given in (29) and (51)–(53), respectively. To numerically estimate and for the MFSSTT distribution, we resort to the following two root-finding equations:
CMQ-Step 4: and are obtained by solving the roots of the following two equations:
(54)
and(55)
4. Simulation Studies
4.1. Recovery of the True Underlying Parameters
The first experiment intends to investigate the ability of the proposed ECME algorithm to recover the true underlying parameters. Monte Carlo samples of different sample sizes , 250, 500, and 1000 were generated from the MFSSN distributions specified in (7) and five examples of SSMFSSN distributions studied in Section 3. For ease of exposition, we considered the Hadamard power transformation of order three () that allows two shape parameters in the skewing function . Moreover, the flatness parameters for MFSSCN and MFSSTT were assumed to be equal, say , referred to as the MFSSCNe and MFSSTTe distributions. The presumed true parameters were , , , and . Furthermore, was taken for the MFSSTN, MFSSSLN, MFSST, and MFSSTTe distributions, while was adopted for the MFSSCNe distribution since its support lies within the interval . As an illustration, Figure 2 displays the scatter plots superimposed on the fitted contours for each type of data simulated from one trail.
For all scenarios, the accuracies of the parameter estimates are assessed by computing the mean absolute bias (MAB) and the root mean square error (RMSE) over replications. For a vector of parameters , these measures are, respectively, defined as
(56)
where denotes the ML estimate of the kth parameter at the rth replication and represents the actual value of .The experimental results are summarized in Table 1. It is readily seen both MAB and RMSE values tend to approach zero with increasing the sample size. While this study is limited to the simplest case , our developed ECME algorithm shows favorable ability to recover the true parameter values with data generated exactly according to model assumptions. Similar experiments have also been undertaken on more complex scenarios . The extensive results would not necessarily be excessively reported since the conclusions are in accordance with those already presented.
4.2. Comparing the Proposed Procedure with Convolution-Type EM Algorithms
The second experiment aims to compare the performance of the proposed selection-type ECME procedure outlined in Section 2.2 with the traditional EM-based algorithms derived based on convolution-type representations. As an illustration, we consider the fitting of MSN, MST, and MSCN distributions arisen from the multivariate skew-normal independent (SNI) family studied by Cabral et al. [22]. As discussed above, they are special cases of our proposed MFSSN, MFSST, and MFSSCN distributions by setting . Accordingly, the CMQ-Step 1 in Section 2 can be simplified as follows:
CMQ-Step 1: Fixing and , we obtain by
(57)
In total, 100 Monte Carlo (MC) samples of sizes , 500, and 1000 were generated from each of the three distributions. The true parameters were the same as those given in the previous experiment except for . Each simulated sample was fitted twice with the proposed selection-type ECME procedure and the EM-type algorithm based on convolution-type representations, as implemented by the
The box plots depicted in Figure 3 reveal the selection-type algorithm demands much lower computational cost than those required for the convolution-type algorithm. The phenomenon is more apparent for the MST and MSCN distributions, particularly for larger n. The high efficiency of the selection-type algorithm can be ascribed to the fact that its E-step is designed by virtue of simplification. Finally, it is worth mentioning that both algorithms can achieve the same final log-likelihood, as demonstrated by violin plots in Figure 4.
5. An Illustrative Example: The Wind Speed Data
We considered a trivariate dataset analyzed by Azzalini and Genton [24] for the study of spatial distribution of wind speed by means of the MST and various MSSMSN distributions proposed by Azzalini and Capitanio [6] and Arellano-Valle et al. [5], respectively. This dataset contains 278 hourly average speed assembled at three meteorological towers: Goodnoe Hills (gh), Kennewick (kw), and Vansycle (vs) from 23 February to 30 November 2003 recorded at midnight when wind speeds tend to peak. The positive and negative signs of wind speed measurements represent a westerly wind direction and an easterly wind direction, respectively. The Ljung–Box test indicates weak serial correlation for observations measured at the three stations. For modeling these data, we followed Azzalini and Genton [24] to treat the observations as being independent and identically distributed. Figure 5 presents histograms overlaid with kernel density curves obtained by using R
Six SSMFSSN models of order 3 were considered to fit the wind speed data. For the sake of comparison, the MST, MSTN, and MSTC distributions belonging to the MSSMSN family [5] were also fitted as sub-models of SSMFSSN subject to the constraint of . To select an appropriate model from the candidates, we adopted the Akaike information criterion (AIC) [25] and the Bayesian information criterion (BIC) [26], which are the two most widely used model selection indices based on penalized likelihood and applicable for both nested and non-nested models. The two criteria are defined as
(58)
where d is the number of free parameters in the model and is the maximized log-likelihood value. A lower AIC or BIC value indicates that a closer fit of the model to the data.Table 2 compares the ML estimation results for nine candidate models. As can be seen, our proposed SSMFSSN models perform favorably as compared to three MSSMSN analogs because they suffer from a lack of ability to capture the possibly bimodal behavior of the wind speed data (Figure 5). Accordingly, the MFSSTT distribution provides the best fit in terms of the lowest value of AIC as well as BIC, followed by the MFSST distribution. The MSTC and MST are the top two MSSMSN models with smaller AIC and BIC values.
Table 3 summarizes the resulting ML estimates of 6 considered SSMFFSN models together with their asymptotic standard errors obtained by performing the parametric bootstrap method [27]. Notably, the estimates of the shape and flat parameters indicate the presence of skewed and leptokurtic characteristics toward different directions among the three variables. As an illustration, the fitted 3D contour densities of MSTC, MST, MFSST, and MFSSTT distributions are depicted in Figure 6. It is interesting to see that the MFSSTT model having the smallest AIC and BIC can adapt the shape of the wind speed data more closely than the other three competitors.
6. Conclusions
We introduce a novel family of SSMFSSN distributions as a generalization of the work of Mahdavi et al. [10] that can capture simultaneously the dependency among multivariate responses, skewness, heavy-tailedness, and, in particular, multimodal density shapes without resorting to the use of finite mixtures [14,15,21]. Since the SSMFSSN model cannot be represented by a convolution-type form, this stimulates us to devise a feasible ECME algorithm for ML estimation under the selection-type mechanism. The effectiveness and efficiency of the algorithm are evaluated by conducting two simulation studies. Numerical analysis of a real dataset highlights the potential and capability of our proposed approach as a promising alternative tool for modeling multimodal multivariate data with asymmetrical behavior. Computer programs for implementation of our methods can be installed as a R package from Github devtools::install_github(“a-mahdavi/SSMFSSN.EM”). Further developments of the current approach could be exploited for powerful extensions of the factor analysis model or finite mixtures thereof with censored or possibly missing values that were considered recently by the authors of [28,29,30,31,32,33,34,35]. One limitation of the SSMFSSN model is that it may not be suited to the data with modes having too far distances. Another worthwhile extension of this work is to pursue a mixture modeling framework of the current approach that would be an effective way to resolve this problem.
Author Contributions
A.M. and T.-I.L. conceived the project, developed the statistical methods, designed the approach, and analyzed the results. All authors contributed to the development of the methodology and to writing the manuscript. All authors have read and agreed to the published version of the manuscript.
Funding
TIL was partially supported by the Ministry of Science and Technology of Taiwan (Grant No. MOST 109-2118-M-005-005-MY3).
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
The authors gratefully acknowledge three anonymous referees for their comments and suggestions that greatly improved this paper.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. The Hadamard Product
Below, we present three propositions concerning the multiplication and partial derivatives of the Hadamard product which are useful for our methodological developments.
Proposition A1.
Let be a matrix and be a diagonal matrix. Then, , where .
Proof.
Multiplication of a Hadamard product by diagonal matrix satisfies the following equation:
(A1)
By taking in (A1), we obtain
(A2)
Now, consider ; using (A2), we have
Similarly, for ,we have
Clearly, the desired statement can be established by mathematical induction on n. □
Proposition A2.
Let be a matrix and and be two column vector. Then,
Proof.
Let . It is easy to show
where “:” denotes Frobenius products defined as .Making use of the following facts
the differential of the scalar function f yieldsThis completes the proof. □
Proposition A3.
Let be a matrix and and be two column vector then,
Proof.
Similar to the proof of Proposition A2, we define and obtain its differential as below
which completes the proof. □Appendix B. Proof of Equation (43)
Proof.
Without loss of generality, we assume that and . Let , where ∼ and ∼ are two independent random variables and . Clearly, ∼. Therefore, it is easy to verify that ∼, ∼ and
By Bayes’ theorem, the pdf of is
This completes the proof. □
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures and Tables
Figure 1. Four scatter-contour plots coupled with their marginal histograms of the bivariate MFSSN distribution: (a) m=2, λ1=(0,2)⊤ and λ2=(0,−2)⊤ (bimodal); (b) m=2, λ1=(−2,2)⊤ and λ2=(1,−1)⊤ (trimodal); (c) m=3, λ1=(−2,2)⊤, λ2=(1,−1)⊤ and λ3=(2,−2)⊤ (trimodal); (d) m=3, λ1=(0,2)⊤, λ2=(0,−2)⊤ and λ3=(2,−1)⊤ (trimodal).
Figure 2. Scatter-contour plots of one simulation case with 500 random samples generated from six subfamilies of the proposed model.
Figure 3. Box plots of CPU time for convergence of selection-type and convolution-type algorithms for fitting MSN, MST, and MSCN distributions under various sample sizes.
Figure 4. Violin plots of converged log-likelihood obtained by selection and convolution EM-type algorithms for fitting MSN, MST, and MSCN distributions under various sample sizes.
Figure 5. Histograms of univariate measurements overlaid with kernel density curves for the hourly average wind speed collected at three meteorological towers.
Figure 6. The 3-D contour densities for MSTC, MST, MFSST, and MFSSTT distributions fitted to the wind speed data.
Simulation results based on 100 replications with different sample sizes.
Model | Parameter | n = 100 | n = 250 | n = 500 | n = 1000 | ||||
---|---|---|---|---|---|---|---|---|---|
MAB | RMSE | MAB | RMSE | MAB | RMSE | MAB | RMSE | ||
MFSSN | 0.090 | 0.122 | 0.057 | 0.074 | 0.037 | 0.048 | 0.027 | 0.033 | |
0.257 | 0.382 | 0.143 | 0.209 | 0.115 | 0.168 | 0.081 | 0.123 | ||
0.476 | 0.625 | 0.203 | 0.270 | 0.111 | 0.155 | 0.079 | 0.112 | ||
0.339 | 0.489 | 0.160 | 0.209 | 0.104 | 0.131 | 0.064 | 0.082 | ||
MFSSTN | 0.095 | 0.127 | 0.062 | 0.086 | 0.040 | 0.053 | 0.028 | 0.036 | |
0.419 | 0.673 | 0.244 | 0.383 | 0.158 | 0.241 | 0.121 | 0.179 | ||
0.382 | 0.469 | 0.349 | 0.426 | 0.317 | 0.382 | 0.306 | 0.355 | ||
0.335 | 0.439 | 0.194 | 0.244 | 0.177 | 0.216 | 0.151 | 0.181 | ||
2.495 | 10.745 | 0.458 | 0.651 | 0.319 | 0.413 | 0.203 | 0.258 | ||
MFSSSLN | 0.091 | 0.121 | 0.052 | 0.069 | 0.036 | 0.047 | 0.025 | 0.035 | |
0.424 | 0.641 | 0.306 | 0.489 | 0.190 | 0.314 | 0.127 | 0.200 | ||
0.458 | 0.588 | 0.267 | 0.347 | 0.216 | 0.269 | 0.174 | 0.206 | ||
0.470 | 0.614 | 0.259 | 0.378 | 0.174 | 0.227 | 0.114 | 0.138 | ||
6.530 | 11.849 | 3.425 | 8.206 | 1.191 | 3.251 | 0.481 | 0.721 | ||
MFSSCNe | 0.087 | 0.117 | 0.048 | 0.064 | 0.036 | 0.047 | 0.026 | 0.034 | |
0.363 | 0.546 | 0.285 | 0.440 | 0.223 | 0.339 | 0.133 | 0.202 | ||
0.446 | 0.595 | 0.288 | 0.353 | 0.202 | 0.247 | 0.138 | 0.174 | ||
0.426 | 0.586 | 0.265 | 0.347 | 0.178 | 0.219 | 0.131 | 0.160 | ||
0.199 | 0.216 | 0.162 | 0.176 | 0.116 | 0.137 | 0.083 | 0.098 | ||
MFSST | 0.114 | 0.152 | 0.069 | 0.091 | 0.042 | 0.055 | 0.032 | 0.040 | |
0.393 | 0.586 | 0.250 | 0.386 | 0.160 | 0.238 | 0.132 | 0.216 | ||
0.453 | 0.559 | 0.265 | 0.329 | 0.207 | 0.253 | 0.194 | 0.228 | ||
0.355 | 0.468 | 0.215 | 0.273 | 0.137 | 0.175 | 0.110 | 0.139 | ||
1.152 | 2.210 | 0.430 | 0.621 | 0.266 | 0.366 | 0.207 | 0.265 | ||
MFSSTTe | 0.114 | 0.146 | 0.070 | 0.094 | 0.047 | 0.061 | 0.029 | 0.038 | |
0.400 | 0.654 | 0.213 | 0.323 | 0.165 | 0.255 | 0.115 | 0.188 | ||
0.437 | 0.546 | 0.411 | 0.476 | 0.406 | 0.453 | 0.414 | 0.438 | ||
0.313 | 0.409 | 0.216 | 0.260 | 0.183 | 0.217 | 0.167 | 0.193 | ||
1.332 | 2.733 | 0.424 | 0.573 | 0.300 | 0.408 | 0.210 | 0.279 |
Summary results from fitting various models to the wind speed data. The model with the smallest value of AIC and BIC is displayed in bold.
Family | Model | d | AIC | BIC | |
---|---|---|---|---|---|
MSTC | –3178.7 | 13 | 6383.4 | 6430.5 | |
MSSMSN | MST | –3180.7 | 13 | 6387.5 | 6434.6 |
MSTN | –3180.9 | 13 | 6387.8 | 6434.9 | |
MFSSN | –3171.7 | 15 | 6373.4 | 6427.8 | |
MFSSTN | –3145.6 | 16 | 6323.1 | 6381.2 | |
SSMFSSN | MFSSSLN | –3147.1 | 16 | 6326.2 | 6384.2 |
MFSSCN | –3145.6 | 16 | 6323.3 | 6381.3 | |
MFSST | –3143.0 | 16 | 6318.1 | 6376.1 | |
MFSSTT | –3138.2 | 17 | 6310.4 | 6372.1 |
ML estimates of key parameters for six SSMFSSN models. The associated standard errors are shown in parentheses.
Parameter | MFSSN | MFSSTN | MFSSSLN | MFSSCN | MFSST | MFSSTT |
---|---|---|---|---|---|---|
23.0(0.05) | 21.3 (0.09) | 21.1 (0.04) | 21.5 (0.07) | 19.9 (0.06) | 18.6 (0.08) | |
14.8 (0.04) | 15.6 (0.08) | 15.2 (0.04) | 15.0 (0.07) | 15.1 (0.04) | 15.0 (0.06) | |
14.6 (0.04) | 14.9 (0.06) | 14.8 (0.02) | 15.4 (0.04) | 13.4 (0.08) | 12.6 (0.09) | |
221.2 (0.26) | 122.9 (0.49) | 80.6 (0.24) | 115.8 (0.31) | 115.6 (0.21) | 115.6 (0.25) | |
138.8 (0.16) | 96.4 (0.36) | 64.6 (0.18) | 91.5 (0.24) | 92.8 (0.15) | 93.2 (0.17) | |
151.0 (0.13) | 104.8 (0.26) | 69.5 (0.13) | 99.1 (0.16) | 102.9 (0.18) | 106.0 (0.19) | |
181.3 (0.10) | 134.5 (0.27) | 90.3 (0.16) | 128.0 (0.18) | 131.2 (0.18) | 132.1 (0.20) | |
111.4 (0.07) | 80.8 (0.18) | 54.4 (0.09) | 79.6 (0.12) | 78.8 (0.15) | 79.1 (0.15) | |
296.4 (0.07) | 203.4 (0.19) | 134.0 (0.13) | 187.2 (0.09) | 204.4 (0.24) | 206.7 (0.27) | |
–1.7 (0.01) | –0.4 (0.04) | –0.2 (0.01) | –0.3 (0.02) | 0.1 (0.02) | 1.0 (0.02) | |
1.6 (0.04) | 1.1 (0.04) | 1.0 (0.03) | 1.3 (0.04) | 1.3 (0.01) | 3.2 (0.01) | |
1.9 (0.04) | 1.4 (0.04) | 1.1 (0.02) | 1.3 (0.02) | 1.4 (0.01) | 2.8 (0.02) | |
–0.7 (1.99) | –0.1 (2.36) | 0.0 (1.03) | –0.1 (1.53) | –0.2 (0.04) | 1.2 (0.32) | |
–0.7 (0.05) | –0.4 (0.25) | –0.2 (0.05) | –0.4 (0.19) | –0.5 (0.01) | –1.6 (0.21) | |
–0.1 (1.52) | –0.2 (1.98) | –0.1 (0.78) | –0.2 (1.18) | –0.1 (0.02) | –2.3 (0.17) | |
– | 5.0 (0.18) | 1.5 (0.03) | 0.2 (0.04) | 4.7 (0.06) | 4.7 (0.08) | |
– | – | – | 0.2 (0.54) | – | 1.0 (0.16) |
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2021 by the authors.
Abstract
Multivariate skew-symmetric-normal (MSSN) distributions have been recognized as an appealing tool for modeling data with non-normal features such as asymmetry and heavy tails, rendering them suitable for applications in diverse areas. We introduce a richer class of MSSN distributions based on a scale-shape mixture of (multivariate) flexible skew-symmetric normal distributions, called the SSMFSSN distributions. This very general class of SSMFSSN distributions can capture various shapes of multimodality, skewness, and leptokurtic behavior in the data. We investigate some of its probabilistic characterizations and distributional properties which are useful for further methodological developments. An efficient EM-type algorithm designed under the selection mechanism is advocated to compute the maximum likelihood (ML) estimates of parameters. Simulation studies as well as applications to a real dataset are employed to illustrate the usefulness of the presented methods. Numerical results show the superiority of our proposed model in comparison to several existing competitors.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details

1 Department of Statistics, Faculty of Mathematics & Computer, Shahid Bahonar University of Kerman, Kerman 7616914111, Iran;
2 Institute of Statistics, National Chung Hsing University, Taichung 402, Taiwan; Department of Public Health, China Medical University, Taichung 404, Taiwan