Abstract

We consider the problem of constructing a synthetic sample from a population of interest which cannot be sampled from but for which the population means of some of its variables are known. In addition, we assume that we have in hand samples from two similar populations. Using the known population means, we will select subsamples from the samples of the other two populations which we will then combine to construct the synthetic sample. The synthetic sample is obtained by solving an optimization problem, where the known population means, are used as constraints. The optimization is achieved through an adaptive random search algorithm. Simulation studies are presented to demonstrate the effectiveness of our approach. We observe that on average, such synthetic samples behave very much like actual samples from the population of interest. As an application we consider constructing a one-percent synthetic sample for the missing 1890 decennial sample of the United States.

Details

Title
Constructing Synthetic Samples
Author
Dong, Hua; Meeden, Glen
Pages
113-127
Publication year
2016
Publication date
2016
Publisher
Statistics Sweden (SCB)
ISSN
0282423X
e-ISSN
20017367
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1821868663
Copyright
Copyright Statistics Sweden (SCB) 2016