Content area

Abstract

This study investigates the determination of stratification points for two study variables within the framework of simple random sampling, with a focus on estimating the population mean using a closely related auxiliary variable. Employing a superpopulation model, the research aims to minimize overall variance by deriving simplified equations that enhance the precision of parameter estimates. Instead of categorizing variables, the study emphasizes continuous variables to establish optimal strata boundaries (OSB), which are essential for creating homogeneous groups within each stratum. This stratification leads to more efficient sample sizes (SS) and improved accuracy in parameter estimation. However, achieving optimal OSB and SS poses challenges in scenarios with a fixed total sample size, such as survey designs constrained by limited budgets. To address this, the study proposes a robust methodology for calculating OSB and SS, leveraging knowledge of the survey’s per-unit stratum measurement costs or its probability density function. An empirical application of the method is demonstrated using breast cancer data, where the mean perimeter is estimated based on mean radius and mean texture. Additionally, hypothetical examples using Cauchy and standard power distributions are provided to illustrate the versatility of the proposed approach. The newly developed method has been integrated into the updated stratifyR package and implemented in LINGO software, facilitating its practical application. Comparative analysis reveals that this approach consistently outperforms or matches existing methods in enhancing the precision of population parameter estimation. Furthermore, simulation studies confirm its higher relative efficiency, making it a valuable contribution to the field of stratified sampling.

Details

1009240
Business indexing term
Title
A design-based framework for optimal stratification using super-population models with application on real data set of breast cancer
Author
Publication title
PLoS One; San Francisco
Volume
20
Issue
5
First page
e0323619
Publication year
2025
Publication date
May 2025
Section
Research Article
Publisher
Public Library of Science
Place of publication
San Francisco
Country of publication
United States
e-ISSN
19326203
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Milestone dates
2024-12-14 (Received); 2025-04-10 (Accepted); 2025-05-22 (Published)
ProQuest document ID
3206831876
Document URL
https://www.proquest.com/scholarly-journals/design-based-framework-optimal-stratification/docview/3206831876/se-2?accountid=208611
Copyright
© 2025 Faizan Danish. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-05-23
Database
ProQuest One Academic