Improving the representativeness of a simple random sample: an optimization model and its application to the continuous sample of working lives
Mostra el registre complet de l'element
Visualització
(1.408Mb)
|
|
|
|
|
|
Núñez Antón, Vicente; Pérez-Salamero González, Juan Manuel; Regúlez Castillo, Marta; Vidal Meliá, Carlos
|
|
Aquest document és un/a article, creat/da en: 2020
|
|
|
|
This paper proposes an optimization model for selecting a larger subsample that improves the representativeness of a simple random sample previously obtained from a population larger than the population of interest. The problem formulation involves convex mixed-integer nonlinear programming (convex MINLP) and is, therefore, NP-hard. However, the solution is found by maximizing the size of the subsample taken from a stratified random sample with proportional allocation and restricting it to a p-value large enough to achieve a good fit to the population of interest using Pearson’s chi-square goodness-of-fit test. The paper also applies the model to the Continuous Sample of Working Lives (CSWL), which is a set of anonymized microdata containing information on individuals from Spanish Social Security records and the results prove that it is possible to obtain a larger subsample from the CSWL that (far) better represents the pensioner population for each of the waves analyzed.
|
|
Veure al catàleg Trobes
|
|
|
Aquest element apareix en la col·lecció o col·leccions següent(s)
Mostra el registre complet de l'element