This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
In practice, collecting all of the information on an object under investigation is challenging; therefore, predictions and decision-making studies are based on samples. Using probability theory, sampling is an art form for determining the dependability of available data. Simple random sampling (SRS) is the most common and simplest approach for selecting samples with equal probability at each selection while avoiding the concentration of auxiliary information. We collect some additional information (X) that is positively or negatively connected to the variable of interest (Y) in real-life situations with the variable of interest (Y). If we incorporate new information into classical estimators, we will get flexible results. Many researchers are presently striving to increase the flexibility of existing estimators by incorporating additional data. For example, Kadilar and Cingi [1] worked on the regression type estimators, Yan and Tian [2], Ijaz et al. [3–5].
The usual estimator of the population mean is defined by
The bias and mean square error of
Kadilar and Cingi [2006] introduced a classical ratio and regression estimator.
The bias of
The mean square error of
Yan and Tian [2010] suggested the efficient ratio-type estimators
The mean square error of
Ijaz et al. [3] proposed ratio and regression type estimators
The mean square error is, respectively, given by
Other estimators of Ijaz et al. [4, 5] are defined by
The mean square error of proposed ratio type estimators is
The Jeelani et al. [2013] recommended ratio estimator is as follows:
The mean square error of the above estimator is defined by
2. Research Problem
In actual, some data sets have a broad range of values known as outliers. The classical estimators will result in an incorrect conclusion and overfitting of the model in such a case. The primary goal of the current work is to create an estimate that will not be significantly impacted by an outlier. This paper used the midrange and interdecile range to investigate novel robust type ratio type estimators.
3. Methodology of the Proposed Estimators
The study is motivated by Kadilar and Cingi [1] where the authors proposed some regression type estimators. The study of Kadilar and Cingi [1] was not taken into account the data sets with an outlier. The current study focused to cover this gap and developed some robust type estimators that are not much effective against outliers. This paper presents new estimators for estimating the population means using the auxiliary information in the forms of midrange (MR) and interdecile range (IDR). The proposed estimators are defined by
To derive the estimator bias, and mean square error, we consider
applying expectations on both sides, we get the bias of
Squaring and applying expectations on both sides of equation (26), we get
The mean square error of
3.1. Theoretical Conditions
In this section, theoretical conditions are derived so that to assess the performance of the proposed estimators as compared to the existing estimators. The MSE of the proposed estimators is given in equation (29) with the usual mean estimator given in equation (2) can be compared in the following way.
Similarly, the Mse of the proposed estimator given in equation (29) can be compared with that of the Mse given in equation (29), we have the following.
The proposed estimator leads to a better performance as compared to others iff the above conditions are satisfied. Table 1 defines the result of theoretical conditions using population data sets 1 and 2.
Table 1
Numerical values of theoretical conditions using data sets 1 and 2.
Estimators | Population I | Population II |
−10264.81 | −18618.73 | |
−11445.01 | −19321.21 | |
−12782.44 | −22537.39 | |
−12723.93 | −23096.04 | |
−12786.60 | −23034.91 | |
−6363.179 | −18188.94 | |
−7543.376 | −18891.41 | |
−8880.813 | −22107.6 | |
−8822.300 | −22666.22 | |
−8885.067 | −22605.4 |
The results of Table 1 clearly demonstrate that the aforementioned theoretical requirements are met for both data sets; hence, it is anticipated that the suggested estimators will perform better for these two data sets than for others.
3.2. Applications
The paper proposed the robust type estimators, and hence, we considered two data sets with outliers. The data sets were obtained from the Italian Bureau of the Environment Protection [7] and recently cited by Abid et al. [8]. The data statistics are given in Tables 2 and 3.
Table 2
Data statistics.
Table 3
Data statistics.
The percentage relative efficiency (PRE) is shown in Tables 4 and 5computed with the following mathematical formula:
Table 4
Bias, MSE, and PRE of the proposed and other estimators using Population I.
Estimators | Bias | MSE | PRE |
0 | 127.6071 | 100 | |
1.147847 | 131.5218 | 97.0235 | |
1.148111 | 131.5384 | 97.0113 | |
1.144672 | 131.3230 | 97.1704 | |
1.150690 | 131.6999 | 96.89232 | |
1.055885 | 125.7631 | 101.4662 | |
0.02962096 | 60.57783 | 210.6498 | |
0.15196360 | 66.64281 | 191.4792 | |
1.02040400 | 116.3789 | 109.6479 | |
0.70222700 | 97.74705 | 130.5483 | |
0.8740394 | 107.7786 | 118.3975 |
Table 5
Bias, MSE, and PRE of the proposed and other estimators using Population II.
Estimators | Bias | MSE | PRE |
0 | 349.833 | 100 | |
4.040599 | 529.0587 | 66.12366 | |
4.043777 | 529.4648 | 66.07294 | |
4.034300 | 528.2538 | 66.22442 | |
4.113499 | 538.3750 | 64.97942 | |
2.974909 | 392.8679 | 89.04595 | |
−0.463064 | 92.69699 | 377.3941 | |
−0.4732379 | 76.95468 | 454.5961 | |
1.2816110 | 64.48238 | 542.5249 | |
0.7250457 | 33.81653 | 1034.503 | |
0.8101430 | 37.97263 | 921.2766 |
Tables 4 and 5 define the Bias, Mse, and PRE of the proposed and other existing estimators. The results concluded that the proposed estimators have a fewer Mse and high PRE as compared to others and thus lead to a preferable fit.
4. Discussion and Conclusion
The traditional or common estimators are consistently inadequate for estimating the population parameter in practice. The traditional estimators overestimate the sets of data that contain an outlier(s). The current work examines various innovative estimators that are less susceptible to these outliers as a result. In this paper, novel regression-ratio type estimators are proposed by using the midrange (MR) and interdecile range (IDR). To evaluate the model performance, we derived theoretical conditions and used real-world data sets to back up our findings. Furthermore, the proposed as well as alternative estimators’ results for the bias, mean square error, and percentage relative efficiency (PRF) are computed. The proposed estimators are superior to others in terms of PRE, but their Mse is the lowest of all. It is obvious that the suggested estimators outperform other methods in terms of results
[1] C. Kadilar, H. Cingi, "An improvement in estimating the population mean by using the correlation coefficient," Hacettepe Journal of Mathematics and Statistics, vol. 35, 2006.
[2] Z. Yan, B. Tian, "International Conference on Information Computing and Applications," Proceedings of the 2010 ICICA 2010: Proceedings of the First International Conference on Information Computing and Applications, .
[3] M. Ijaz, A. U. Khan, S. M. Asim, Y. Hayat, S. H. Shah, M. Farooq, K. Ali, Neelam, S. Zubair, "A new ratio type estimator to estimate the population mean using auxiliary information," Advances and Applications in Statistics, vol. 63 no. 1, pp. 97-108, DOI: 10.17654/as063010097, 2020.
[4] M. Ijaz, T. Zaman, H. Bulut, A. Ullah, S. M. Asim, "An improved class of regression estimators using the auxiliary information," Journal of Science and Arts, vol. 20 no. 4, pp. 789-800, DOI: 10.46939/j.sci.arts-20.4-a01, 2020.
[5] M. Ijaz, A. Ullah, T. Zaman, "On the development of the ratio type estimators using auxiliary information," Journal of Science and Arts, vol. 21 no. 1, pp. 163-170, DOI: 10.46939/j.sci.arts-21.1-a14, 2021.
[6] M. I. Jeelani, S. Maqbool, S. A. Mir, International Journal of Modern Mathematical Sciences, vol. 6, 2013.
[7] IBEP, "Improved Ratio Estimators of Variance Based on Robust," 2004. https://www.os.servatorio.nazionaleri-uti.it/elencodocpub.asp?A.tipodoc=6
[8] M. Abid, S. Ahmad, M. Tahir, H. Z. Nazir, M. Riaz, "Improved ratio estimator of variance based on robust measures," Journal of Science Iranica, vol. 26 no. 4, pp. 2484-2494, 2019.
[9] W. G. Cochran, 1940.
[10] C. Kadilar, H. Cingi, "Ratio estimators in simple random sampling," Applied Mathematics and Computation, vol. 151 no. 3, pp. 893-902, DOI: 10.1016/s0096-3003(03)00803-8, 2004.
[11] H. Koç, "Ratio-type estimators for improving mean estimation using Poisson regression method," Communications in Statistics - Theory and Methods, vol. 50 no. 20, pp. 4685-4691, DOI: 10.1080/03610926.2020.1777307, 2021.
[12] C. Kadılar, M. Candan, H. Cingi, "Ratio estimators using robust regression. Hacettepe," Journal of Mathematics and Statistics, vol. 36 no. 2, pp. 181-188, 2007.
[13] E. Oral, C. Kadilar, "Robust ratio-type estimators in simple random sampling," Journal of the Korean Surgical Society, vol. 40 no. 4, pp. 457-467, DOI: 10.1016/j.jkss.2011.04.001, 2011.
[14] T. Zaman, "Improvement of modified ratio estimators using robust regression methods," Applied Mathematics and Computation, vol. 348, pp. 627-631, DOI: 10.1016/j.amc.2018.12.037, 2019.
[15] T. Zaman, H. Bulut, "Modified ratio estimators using robust regression methods," Communications in Statistics - Theory and Methods, vol. 48 no. 8, pp. 2039-2048, DOI: 10.1080/03610926.2018.1441419, 2019.
[16] T. Zaman, H. Bulut, "Modified regression estimators using robust regression methods and covariance matrices in stratified random sampling," Communications in Statistics - Theory and Methods, vol. 49 no. 14, pp. 3407-3420, DOI: 10.1080/03610926.2019.1588324, 2020.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2022 Muhammad Ijaz et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
In real-world situations, the data set under examination may contain uncommon noisy measurements that unreasonably affect the data’s outcome and produce incorrect model estimates. Practitioners employed robust-type estimators to reduce the weight of the noisy measurements in a data set in such a scenario. Using auxiliary information that will produce reliable estimates, we have looked at a few flexible robust-type estimators in this study. In order to estimate the population mean, this study presents unique flexible robust regression type ratio estimators that take into account the data from the midrange and interdecile range of the auxiliary variables. Up to the first order of approximate computation, the bias and mean square were calculated. In order to compare the flexibility of the proposed estimator to those of the existing estimators, theoretical conditions were also obtained. We took into account data sets containing outliers for empirical computation, and it was found that the suggested estimators produce results with higher precision than the existing estimators.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Department of Mathematics and Statistics, University of Haripur, Haripur, Pakistan
2 Department of Statistics, University of Peshawar, Peshawar, Pakistan
3 College of Engineering and Technology, American University of the Middle East, Kuwait