1. Introduction
The beta (B) and Kumaraswamy (K) distributions [1,2] play an important role in the analysis of bounded data such as rates, proportions, and percentages. These distributions, presenting only two shape parameters, have a very flexible probability density function (pdf), presenting monotonic, unimodal, and reverse-unimodal shapes. Structurally, the analytical expressions of the pdf’s of these distributions are similar. The random variable X follows the B distribution with shape parameters , denoted as , if its pdf is given by
where is the beta function. On the other hand, if X has the K distribution with shape parameters , denoted as , its pdf is given byBeing rigorous, it should be mentioned that the support of the previous pdf’s is the closed interval , but the extremes 0 and 1 tend to be omitted to avoid difficulties in obtaining the maximum likelihood (ML) estimators for and . When the minimum observation in the data is exactly the value 0 or the maximum observation is exactly the value 1, the likelihood is null or indeterminate depending on whether and are greater than or less than 1. Consequently, the ML estimates of and cannot be obtained.
Note that the presence of the value 0 or 1 in the data will depend on the nature of the phenomenon being studied. For example, if the monthly proportion of the shared household budget allocated to some type of expense (food, clothing, transportation, etc.) is studied, it is to be expected that some observations will be exactly 0, since some households may not buy clothing in a given month, see for example Blundell et al. [3]. In a scenario like this, the following two paths are frequently chosen to overcome this difficulty:
1. Use another estimation method that provides consistent and efficient estimators, the moment method is a good alternative in this case. Details about the moment estimators for the shape parameters of the B and K distributions can be found in Johnson et al. [1] and Dey et al. [4].
2. Use the four-parameter version of the B distribution with two location parameters. This version is defined by the transformation , where and are the location parameters. Thus, Y follows the B distribution over the interval . Similarly, the above is valid for defining the four-parameter version of the K distribution. When a and b are known, or specified under some criteria by the analyst, ML estimates of the shape parameters of the B and K distributions can be obtained following the standard procedures of the ML method. When a and b are unknown, obtaining the estimates can be problematic because the regularity conditions are not satisfied, the support of the distributions depends on the location parameters. See Wang [5] and Smith [6] for some details on classes of non-regular distributions in ML estimation.
In this article, we propose a new two-parameter distribution for fitting bounded data to the unit interval. The pdf of the new distribution, depending on its parameters, can present monotonic, unimodal and reverse-unimodal shapes. We note that the pdf tends to a positive finite value at the lower end of its support, consequently, the likelihood is not null or indeterminate when trying to fit a data set whose minimum observation is exactly 0. Thus, the ML estimates for its parameters can be obtained.
The proposed distribution arises from the Lambert-F distribution generator [7], defined by the cumulative distribution function (cdf)
(1)
where e is the Euler’s number, is an arbitrary continuous cdf with parameter vector , is the corresponding quantile function (qf), and denotes the principal branch of the Lambert W function. See Corless et al. [8] and Brito et al. [9] for details of the Lambert W function.Under a uniform (U) baseline distribution, Iriarte et al. [10] propose the Lambert-uniform (LU) distribution defined by the pdf , , . This pdf has a monotonic shape, increasing or decreasing depending on , and converges to finite values at the ends of its support. Iriarte et al. [10] show that the LU distribution can perform better than the B and K distributions when the histogram of the data exhibits increasing or decreasing behavior.
In our case, in Equation (1), we consider a proportional hazard uniform (PHU) baseline distribution [11], defined by the cdf , , . This distribution is characterized by having a hazard rate function (hrf) proportional to the hrf of the U distribution and, furthermore, by presenting a monotonic pdf. In this way, we obtain a new two-parameter distribution whose pdf tends to a positive finite value at the lower end of its support (similar to the LU pdf), which in addition to exhibiting monotonic shapes is also capable of presenting unimodal and reverse-unimodal shapes.
The article is organized as follows. In Section 2, we propose the new distribution and derive its main structural properties as pdf, cdf, and hrf. In addition, we derive the raw moments that are used to describe the behavior of the skewness and kurtosis of the distribution. In Section 3, we address the problem of parameter estimation via the ML method. We developed a simulation study to evaluate the behavior of the estimators. In Section 4, we present two application examples with real data in order to illustrate the usefulness of the new distribution. The concluding remarks are considered in Section 5.
2. The New Distribution
In this section, we propose the new distribution, derive some of its main structural properties, and describe the behavior of skewness and kurtosis.
2.1. LPHU Random Variable
In what follows, we define the LPHU random variable and derive some of its main properties.
A random variable X follows the Lambert proportional hazard uniform distribution with shape parameter and , denoted as , if it can be represented as
(2)
where e is the Euler’s number, is the principal branch of the Lambert W function, and U is a uniform random variable.Considering the change of variable , it can be directly verified that and . Thus, taking into account that is a monotonic function, we observe that X is a one-to-one transformation of U that maps values from the interval to the interval . In consequence, the distribution of X inherits the support of the distribution of U, but it will present a greater variety of shapes due to the extra parameter.
Let . Then, the cdf of X is given by
(3)
From Equation (2), for , we have that
Then, by definition of the Lambert W function, it follows that and the result is obtained taking into account that , since U has uniform distribution. Finally, note that the expression obtained is also valid for , once is the PHU cdf. □The pdf of X can be obtained in a straightforward way from Proposition 1.
Let . Then, the pdf of X is given by
(4)
Note that the functions given in Equations (2)–(4) are closed, so they are easy to implement computationally. We use the lamW package [12] of the R programming language [13] for the computation of the principal branch of the Lambert W function. Regarding the shapes of the LPHU pdf, we observe that:
The LPHU pdf is not null at the lower end of its support, . Thus, the LPHU has a behavior similar to that of the LU and PHU pdf’s, but with the advantage that it can present unimodal and reverse-unimodal shapes;
Equation (4) reduces to the PHU, LU, and uniform pdf’s when , and , respectively. Thus, for such parameter choices, the LPHU pdf inherits the shapes of the PHU, LU, and U pdf’s;
For and , we observe that the equation leads to the statement that the LPHU pdf may have a critical point at
where is a maximum or a minimum if or , respectively, such that , where .
The upper left panel of Figure 1 shows some LPHU pdf curves for different choices of and . In the figure, it can be seen that the LPHU pdf can present monotonic, unimodal, and reverse-unimodal shapes. In the same figure (upper right panel and lower panels) the histograms of three sets of LPHU pseudo-random numbers are presented together with the B and K pdf’s equipped with the ML estimates. Pseudo-random numbers were generated from Equation (2) by considering a uniform random input. In the figure, it can be seen that the B and K pdf’s deviate from the relative frequencies exhibited by the histograms, especially in the lower quantiles, since the B and K pdf’s tend to ∞ or 0 at the lower end of the support. This suggests that the LPHU distribution could have a better performance than the B and K distributions when fitting real data that present a histogram with a behavior similar to that shown in this figure.
R codes for the computation of Equations (3) and (4) and for the generation of pseudo-random numbers from the LPHU distribution are provided in Appendix A.
2.2. Related Distributions
By choosing suitable values for the shapes parameters of the LPHU distribution it is possible to distinguish the following special cases: 1. If , the LPHU distribution reduces to the PHU distribution; 2. If , the LPHU distribution reduces to the LU distribution; 3. If , the LPHU distribution reduces to the U distribution.
It is well known that some distributions such as the exponential, Rayleigh, and power, among others, can be derived as a transformation of a U random variable. Considering these transformations on a LU random variable, we derive the following distributions:
Let , where and . Then, Y follows the nonscaled Lambert-exponential distribution. See Iriarte et al. [7];
Let , where and . Then, Y follows the Lambert–Rayleigh distribution. See Iriarte et al. [7];
Let , where and . Then, the distribution of Y is a three-parameter distribution that reduces to the K distribution when . In this case, the cdf of Y is given by , where . Thus, we refer to this distribution as the Lambert–Kumaraswamy distribution.
Other distributions of the literature can be derived under consideration of appropriate transformations of LPHU random variables. Illustratively, we consider in this section only the three transformations described above. As a final consideration of this section, we highlight that the linear transformation , where , with , follows a LPHU distribution on the continuous range . Therefore, the LPHU distribution can be easily used to fit bounded data to any real range.
2.3. Hazard Rate Function
The reliability function (rf) and the hazard rate function (hrf) play an important role in the analysis of lifetime data in reliability studies. In the following statement the rf and hrf of the LPHU distribution are derived.
Let . Then, the rf and the hrf of T are given by
(5)
(6)
If T is a random variable representing the failure time of mechanical units, the rf of T, defined as , , indicates the probability that mechanical units survive beyond the time t. If , the result in (5) is obtained as , where is as in Equation (3)
On the other hand, the hrf of T, defined as (where is the pdf of T), measures the propensity of a mechanical units to fail or die depending on the age it has reached. If , the result in (6) is a consequence of Equations (3) and (4). □
In Proposition 2, it can be seen that the LPHU hrf corresponds to a modification in a multiplicative fashion of the PHU hrf. Furthermore, we observe that
which means that the LPHU hft can be understood as a modification in early times of the baseline PHU hrf. Figure 2 shows some LPHU hrf curves considering different choices for and . In the figure, it can be seen that (like the B and K hrf’s) the LPHU hrf can present monotonic and reverse-unimodal shapes.2.4. Skewness and Kurtosis Behavior
In this section, we describe the skewness and kurtosis behavior of the LPHU distribution by analyzing Fisher’s skewness and kurtosis coefficients. For this, we first derive the raw moments.
Let . Then, for , the rth raw moment of X is given by
where .
From Equation (4), considering the change of variable , we obtain that . Thus, the binomial theorem leads to the expression
and the result is obtained by representing the previous integral as and noting that . □Note that the function in Proposition 3 must be calculated by numerical integration. A good alternative is to use the integrate function of the R language.
Let . Then, the mean and variance of X are and , where , with , is as in Proposition 3.
Let . Then, the Fisher’s skewness and kurtosis coefficients of X are given by
where , with , is as in Proposition 3.
Due to the analytical complexity it is not possible to obtain closed expressions for the critical points of the Fisher’s skewness and kurtosis coefficients. However, by maximizing and minimizing the coefficients with the help of the R programming language, we obtain approximate ranges of skewness and kurtosis, thus obtaining and .
Figure 3 shows the behavior of Fisher’s skewness and kurtosis coefficients for the LPHU distribution. In the figure, it can be seen that the skewness coefficient behaves monotonically with respect to both parameters, a fact that can be understood as an identifiability indicator in the sense that different values of the parameters lead to different skewness levels and, consequently, to different members of the LPHU family. On the other hand, we observe that the kurtosis coefficient can have a non-monotonic behavior, so that there can be two values of (or ) associated with the same kurtosis level. This can be understood as the kurtosis levels associated with the weight of the left tail when the distribution is negatively skewed and with the weight of the right tail when it is positively skewed.
3. Parameter Estimation
In this section, we discuss the parameter estimation for the LPHU distribution via the maximum likelihood (ML) method and develop a simulation study to evaluate the behavior of the estimators.
3.1. ML Estimation
Given an observed sample from the random variable , the log-likelihood function is given by
(7)
where and .The ML estimators and of and can be obtained by taking the partial derivatives of Equation (7) and solving the corresponding system of equations, which is given by the equations
(8)
(9)
The asymptotic distribution (under regularity conditions) of the ML estimator of is , where is the expected information matrix. Taking into account the structure of Equation (7), we observe that it is not easy to derive the analytical expression of this matrix. Thus, we consider an approximation from the observed information matrix, where the elements of this matrix are computed as minus the second partial derivatives of the log-likelihood function with respect to each parameter (assessed in the ML estimates).
The observed information matrix is given by
whereThen, approximate % confidence intervals for and can be determined by and , respectively, where is the upper th percentile of the standard normal distribution, and and are the diagonal elements of the matrix (assessed in the ML estimates).
3.2. Computational Guidelines
From Equations (8) and (9), it can be seen that the ML estimators cannot be obtained explicitly, so the estimates must be obtained using numerical procedures. A good alternative for this is to use the multiroot function [14] of the R programming language, which implements the Newton–Raphson method for obtaining roots of systems of nonlinear equations.
Due to the above, we consider the following points:
Alternatively, the ML estimates can be obtained by solving the optimization problem , subject to , , where is given in Equation (7).
For this, we use the optim function of the R programming language. An R code is provided in Appendix B.
Specifically, we consider the L-BGSB-B algorithm [15], which allows us to specify the parameter space. This algorithm requires declaring a value in the parameter space to initialize the iterative process. Taking into account that the PHU distribution is a special case of the LPHU distribution, we consider and , where is the ML estimator of the shape parameter of the PHU distribution.
3.3. Simulation Study
We generate 1000 random samples from the LPHU distribution under the sample sizes , 20, 30, …, 1500, respectively, and consider the following scenarios: Scenario A, where and , being the pseudo-random numbers generated from a unimodal LPHU distribution; Scenario B, where and , being the pseudo-random numbers generated from a LPHU distribution with reverse-unimodal pdf; Scenario C, where and , being th pseudo-random numbers generated from a LPHU distribution with increasing pdf.
Pseudo-random numbers were generated from Equation (2) considering the following steps:
Generate ;
Compute .
For each simulated sample, we obtain the ML estimate following the guidelines of Section 3.1. The R code used in scenario A is provided in Appendix C. For scenarios B and C it is enough to modify the initial parameters.
Figure 4, Figure 5 and Figure 6 illustrate the behavior of the average estimate (AE), the standard deviation (SD), the square root of the simulated mean square error (RMSE), the average of the asymptotic standard error (SE), and the probability of coverage of the 95% asymptotic confidence interval (CP) for each set of 1000 estimates obtained under the different sample sizes and scenarios considered. Looking at the figures, we can see that the AE’s tend to be close to the true values of the parameters as the sample size increases. An overestimation of the parameters is observed when the estimates are obtained from small samples (), which decreases rapidly as the sample size increases. The SD’s, RMSE’s, and SE’s are close and decrease towards 0 as the sample size increases, as expected in the standard asymptotic theory. Similarly, the CP’s converge to the nominal values used to construct the confidence intervals as the sample size increases.
4. Data Analysis
In this section, we present two applications of the LPHU distribution in which its performance in fitting real data is contrasted with that of the B and K distributions.
4.1. Firm’s Risk Management Cost Effectiveness
We consider 73 observations on the measure of the firm’s risk management cost effectiveness presented by Schmit and Roth [16]. Some descriptive statistics of the data are as follows: Minimum, 0.0020; Maximum, 0.9755; Fisher’s skewness coefficient, 3.7154; Fisher’s kurtosis coefficient, 17.957.
Table 1 reports the ML estimates, the values associated with the Akaike Information Criterion (AIC) [17], and the Bayesian Information Criterion (BIC) [18], and the p-values of the traditional Anderson–Darling (AD) and Cramer–von Mises (CvM) goodness of fit tests [19] for the LPHU, B, and K distributions fitted to the firm’s risk management cost effectiveness data. Looking at the table, under the 0.05 significance level, we note that the p-values indicate that only the LPHU distribution appropriately models the firm’s risk management cost effectiveness data. In addition, it can be seen that the LPHU distribution is the one with the lowest AIC and BIC values among the fitted distributions, suggesting that this distribution should be selected for modeling the firm’s risk management cost effectiveness data.
We observe that the LPHU pdf, equipped with the ML estimates provided in Table 1, takes the value 2.846 at the lower end of its support, allowing for more accurate modeling of the lower empirical quantiles. Figure 7 presents the empirical distribution contrasted with the fitted LPHU, B, and K distributions. In the Figure, it can be seen that the lower empirical quantiles are closer to the LPHU distribution quantiles.
4.2. Household Shared Budget for Transportation
Studies on the shared household budget are highly valued when trying to find out the evolution of the price level faced by households in a given locality, see for example Blundell et al. [3]. The BudgetUk database [20] of the R language provides information on the proportion of the shared budget that British households spend on different items such as food, clothing, transport, fuel, and alcohol.
In this application, we compare the performance of the LPHU, B, and K distributions by modeling the proportion of the shared budget of households that allocate part of the budget to transportation. These data correspond to 1473 observations bounded to the interval whose histogram exhibits a unimodal behavior. From now on, we will refer to these data simply as the household shared budget proportion data.
For comparison, we use the sample function of the R language to obtain 1000 random samples of size 100 from the household shared budget proportion data. Based on the modified AD and CvM goodness-of-fit tests [21], under the 0.05 significance level, we calculate the proportion of samples where the LPHU, B, and K distributions fit the data appropriately. We call this the non-rejection rate. Additionally, we calculate the proportion of samples where each distribution presents the lowest AIC and BIC values, that is, the proportion of samples where each distribution exhibits the best performance. We call this the hit rate. Table 2 reports the values associated with the non-rejection and hit rates for the LPHU, B, and K distributions fitted to the 1000 samples. In the table, we observe that the LPHU distribution is able to adequately fit a larger proportion of samples than the B and K distributions. In addition, we see that the performance of the LPHU distribution is better in a large proportion of samples, in 83.6% of the samples.
Table 3 shows the parameter estimates and the AIC, BIC, and values for the LPHU, B, and K distributions fitted to a single sample obtained from the household shared budget proportion data. Looking at the table, it can be seen that the LPHU distribution performs more accurately than the B and K distributions. We note that the fitted LPHU pdf takes the value 2.846 at the lower end of its support. This allows the LPHU distribution to more appropriately fit the lower empirical quantiles than the B and K distributions. This is illustrated in the upper left panel of Figure 8. In the upper right panel and the lower panels of Figure 8, the qq-plots for the LPHU, B and K distributions are presented, where it can be seen that the LPHU distribution presents a better fit of the lower quantiles.
5. Concluding Remarks
We propose a new two-parameter distribution for the fit of bounded data. The pdf of the new distribution, called the LPHU distribution, presents shapes similar to those exhibited by the popular B and K distributions, but with the characteristic that it tends towards a positive finite value at the lower end of its support. The latter may lead to a better fit of the lower empirical quantiles of certain datasets. Structural properties such as pdf, cdf, and hrf have a closed analytical structure, so they are easy to process computationally. From the description of Fisher’s skewness and kurtosis coefficients for the LPHU distribution, it is concluded that the LPHU distribution can present both positive and negative skewness, and is also capable of capturing high levels of kurtosis. Regarding the parameter estimation, the ML estimators for the LPHU distribution do not have a closed form, so it is necessary to use numerical procedures to obtain the estimates. We use the optim function of the R programming language to accomplish this task. Simulation studies show that the ML method provides acceptable estimates for the parameters of the LPHU distribution. Finally, two application examples illustrate that the LPHU distribution can perform better when fitting real data than the popular B and K distributions.
Conceptualization, Y.A.I. and M.A.R.; methodology, J.R. and H.V.; software, Y.A.I.; validation, Y.A.I., M.A.R., J.R. and H.V.; formal analysis, Y.A.I., M.A.R., J.R. and H.V.; investigation, Y.A.I.; supervision, Y.A.I. All authors contributed significantly to this research article. All authors have read and agreed to the published version of the manuscript.
Not applicable.
Not applicable.
Not applicable.
The authors declare no conflict of interest.
The following abbreviations are used in this manuscript:
B | beta distribution |
K | Kumaraswamy distribution |
probability density function | |
ML | maximum likelihood |
cdf | cumulative distribution function |
U | uniform distribution |
LU | Lambert uniform distribution |
PHU | proportional hazard uniform distribution |
rf | reliability function |
hrf | hazard rate function |
LPHU | Lambert proportional hazard uniform distribution |
AE | average estimate |
SD | standard deviation |
RMSE | root mean square error |
SE | standard error |
CP | coverage probability |
AD | Anderson–Darling |
CvM | Cramer–von Mises |
AIC | Akaike information criterion |
BIC | Bayesian information criterion |
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Figure 1. Top left: Some LPHU pdf curves considering different values for [Forumla omitted. See PDF.] and [Forumla omitted. See PDF.]. Top right and bottom: Histograms for three sets of 1000 pseudorandom numbers from the LPHU distribution, with [Forumla omitted. See PDF.] = (1.7, 2), (0.5, 1.5), and (0.2, 0.5), respectively, fitted with the B and K distributions via the ML method.
Figure 2. Some LPHU hrf curves for [Forumla omitted. See PDF.] and different values of [Forumla omitted. See PDF.].
Figure 4. The AE, SD, RMSE, SE, and CP for each of the 1000 estimates of [Forumla omitted. See PDF.] (top) and [Forumla omitted. See PDF.] (bottom) obtained in scenario A, under the different sample sizes.
Figure 5. The AE, SD, RMSE, SE, and CP for each of the 1000 estimates of [Forumla omitted. See PDF.] (top) and [Forumla omitted. See PDF.] (bottom) obtained in scenario B, under the different sample sizes.
Figure 6. The AE, SD, RMSE, SE, and CP for each of the 1000 estimates of [Forumla omitted. See PDF.] (top) and [Forumla omitted. See PDF.] (bottom) obtained in scenario C, under the different sample sizes.
Figure 7. (Left): Histogram for the firm’s risk management cost effectiveness data and the fitted pdf curves via the ML method. (Right): Empirical cdf for the firm’s risk management cost effectiveness data and the fitted cdf curves.
Figure 8. Top left: Histogram for a single sample obtained from the household shared budget proportion data and the fitted pdf curves via the ML method. Top right and lower: QQ-plots for the LPHU (red), B (green), and K (blue) distributions.
The parameter estimates with standard errors in parentheses, the AIC and BIC values, and the p-values of the AD and CvM goodness-of-fit tests for each distribution fitted to the firm’s risk management cost effectiveness data.
Distribution |
|
|
AIC | BIC | AD | CvM |
---|---|---|---|---|---|---|
LPHU | 0.012 | 1.889 | −174.3 | −169.7 | 0.484 | 0.616 |
(0.016) | (0.496) | |||||
K | 0.664 | 3.440 | −153.3 | −148.7 | 0.025 | 0.039 |
(0.071) | (0.620) | |||||
B | 0.612 | 3.797 | −148.2 | −143.7 | 0.009 | 0.013 |
(0.085) | (0.715) |
Non-rejection rates based on modified AD (
Non-Rejection Rate | Hit Rate | |||
---|---|---|---|---|
Distribution |
|
|
AIC | BIC |
LPHU | 0.734 | 0.718 | 0.836 | 0.836 |
B | 0.502 | 0.479 | 0.130 | 0.130 |
K | 0.562 | 0.531 | 0.034 | 0.034 |
ML estimates (with SE in parentheses), AIC and BIC values, and modified AD (
Distribution |
|
|
AIC | BIC |
|
|
---|---|---|---|---|---|---|
LPHU | 1.996 | 9.222 | −209.5 | −204.3 | 0.112 | 0.715 |
(0.259) | (0.933) | |||||
B | 1.302 | 8.217 | −203.1 | −197.8 | 0.213 | 1.308 |
(0.166) | (1.228) | |||||
K | 1.273 | 10.539 | −105.1 | −199.9 | 0.177 | 1.092 |
(0.116) | (2.385) |
Appendix A. R Codes
R codes for computing the cdf and pdf and for obtaining pseudo-random numbers from the LPHU distribution.
Appendix B. R Code
R code to obtain ML estimates for the parameters of the LPHU distribution.
Appendix C. R Code
R code used in the simulation study of
References
1. Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions; John Wiley & Sons: Hoboken, NJ, USA, 1995; Volume 2.
2. Kumaraswamy, P. A generalized probability density function for double-bounded random processes. J. Hydrol.; 1980; 46, pp. 79-88. [DOI: https://dx.doi.org/10.1016/0022-1694(80)90036-0]
3. Blundell, R.; Duncan, A.; Pendakur, K. Semiparametric estimation and consumer demand. J. Appl. Econom.; 1998; 13, pp. 435-461. [DOI: https://dx.doi.org/10.1002/(SICI)1099-1255(1998090)13:5<435::AID-JAE506>3.0.CO;2-K]
4. Dey, S.; Mazucheli, J.; Nadarajah, S. Kumaraswamy distribution: Different methods of estimation. Comput. Appl. Math.; 2018; 37, pp. 2094-2111. [DOI: https://dx.doi.org/10.1007/s40314-017-0441-1]
5. Wang, J.Z. A note on estimation in the four-parameter beta distribution. Commun.-Stat.-Simul. Comput.; 2005; 34, pp. 495-501. [DOI: https://dx.doi.org/10.1081/SAC-200068514]
6. Smith, R.L. Maximum likelihood estimation in a class of nonregular cases. Biometrika; 1985; 72, pp. 67-90. [DOI: https://dx.doi.org/10.1093/biomet/72.1.67]
7. Iriarte, Y.A.; de Castro, M.; Gómez, H.W. The Lambert-F distributions class: An alternative family for positive data analysis. Mathematics; 2020; 8, 1398. [DOI: https://dx.doi.org/10.3390/math8091398]
8. Corless, R.M.; Gonnet, G.H.; Hare, D.E.; Jeffrey, D.J.; Knuth, D.E. On the LambertW function. Adv. Comput. Math.; 1996; 5, pp. 329-359. [DOI: https://dx.doi.org/10.1007/BF02124750]
9. Brito, P.; Fabiao, F.; Staubyn, A. Euler, Lambert, and the Lambert W function today. Math. Sci.; 2008; 33.
10. Iriarte, Y.A.; de Castro, M.; Gómez, H.W. An alternative one-parameter distribution for bounded data modeling generated from the Lambert transformation. Symmetry; 2021; 13, 1190. [DOI: https://dx.doi.org/10.3390/sym13071190]
11. Martínez-Florez, G.; Moreno-Arenas, G.; Vergara-Cardozo, S. Properties and inference for proportional hazard models. Rev. Colomb. Estad.; 2013; 36, pp. 95-114.
12. Adler, A. lamW: Lambert-W Function. 2015. R Package Version 2.1.1. Available online: https://doi.org/10.5281/zenodo.5874874 (accessed on 5 January 2023).
13. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021.
14. Soetaert, K. rootSolve: Nonlinear Root Finding, Equilibrium and Steady-State Analysis of Ordinary Differential Equations. 2009; R Package Version 1.6 Available online: https://CRAN.R-project.org/package=rootSolve (accessed on 5 January 2023).
15. Byrd, R.H.; Lu, P.; Nocedal, J.; Zhu, C. A limited memory algorithm for bound constrained optimization. SIAM J. Sci. Comput.; 1995; 16, pp. 1190-1208. [DOI: https://dx.doi.org/10.1137/0916069]
16. Schmit, J.T.; Roth, K. Cost effectiveness of risk management practices. J. Risk Insur.; 1990; 57, pp. 455-470. [DOI: https://dx.doi.org/10.2307/252842]
17. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control.; 1974; 19, pp. 716-723. [DOI: https://dx.doi.org/10.1109/TAC.1974.1100705]
18. Schwarz, G. Estimating the dimension of a model. Ann. Stat.; 1978; 6, pp. 461-464. [DOI: https://dx.doi.org/10.1214/aos/1176344136]
19. Faraway, J.; Marsaglia, G.; Marsaglia, J.; Baddeley, A. Goftest: Classical Goodness-of-Fit Tests for Univariate Distributions. R Package Version 1.2-3. 2021; Available online: https://CRAN.R-project.org/package=goftest (accessed on 5 January 2023).
20. Croissant, Y.; Graves, S. Ecdat: Data Sets for Econometrics. R Package Version 0.4-2. 2022; Available online: https://CRAN.R-project.org/package=Ecdat (accessed on 5 January 2023).
21. Chen, G.; Balakrishnan, N. A general purpose approximate goodness-of-fit test. J. Qual. Technol.; 1995; 27, pp. 154-161. [DOI: https://dx.doi.org/10.1080/00224065.1995.11979578]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
In this article, we propose a new two-parameter distribution for bounded data such as rates, proportions, or percentages. The density function of the proposed distribution, presenting monotonic, unimodal, and inverse-unimodal shapes, tends to a positive finite value at the lower end of its support, which can lead to a better fit of the lower empirical quantiles. We derive some of the main structural properties of the new distribution. We make a description of the skewness and kurtosis of the distribution. We discuss the parameter estimation under the maximum likelihood method. We developed a simulation study to evaluate the behavior of the estimators. Finally, we present two applications to real data providing evidence that the proposed distribution can perform better than the popular beta and Kumaraswamy distributions.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer