Options
Modeling and estimating income data in the presence of distinctive zero and heaped responses
Würbach, Ariane (2016): „Modeling and estimating income data in the presence of distinctive zero and heaped responses“. Bamberg: opus.
Author:
Publisher Information:
Year of publication:
2016
Pages:
Supervisor:
Language:
English
Remark:
Dissertation, Otto-Friedrich-Universität Bamberg, 2016
Licence:
Abstract:
A major part of research data in the social sciences originates from survey interviews. Besides the issue of non-response, questions concerning the accuracy of self-reported data are important research objectives. The focus of this thesis is on heaping behavior in surveyed income data. Heaping, i.e. aberrant concentrations of response values at specific points of the range, is typical for retrospective data, when the respondent is either uncertain about the true value or hesitates to report. A theoretical framework and explanations for heaping are presented.
Measurements for heaping and appropriate strategies to cope with it are discussed afterwards. Heaped data are linked with a loss of information and hence are found to deteriorate effects on the macro- and micro-level. Therefore, exploration of the relationships between heaping behavior and personal as well as context information is valuable. This work provides descriptive evidence for heaping behavior in the income data of the German National Educational Panel Study (NEPS). The data at hand strongly support the assumption that heaping behavior is not stochastic but deterministic, i.e. whether and to which degree heaping occurs is not random. Respective determinants influencing heaping behavior are the response value itself and common socio-economic characteristics. Male, higher educated, and older respondents have a higher propensity to heap their income. Because of that, there is a necessity of adequately addressing this issue, e.g. by a modeling strategy which explicitly takes the non-randomness of the heaping behavior into consideration. According to this, a heaping model is introduced enabling to account for different heaping behaviors. The model is a mixture of two components, the latent distribution and the model for the heaping behavior. A zero-inflated log-normal distribution with a piecewise constant heaping mechanism is defined as base model. The generality and flexibility of the established model is outlined by several modifications and extensions, with respect to the latent distribution, the heaping pattern as well as the heaping mechanism. In the application, all proposed models are explored concerning their fit to the NEPS income data. Posterior predictive checks are used to access the overall fit of the models. This thesis also includes a comparative analysis of different random-walk Metropolis (RWM) algorithms with respect to their estimation accuracy and efficiency. Besides the original RWM algorithm, blocking and adaptive strategies are inquired into. The results indicate that blocking can greatly improve mixing and convergence of the RWM algorithm, in contrast to the adaptive schemes considered. The performance of the models is fairly good, however, large differences in estimation exist with respect to runtime and efficiency. These differences are mainly attributable to the model assumed and the selected specification of the RWM algorithm.
Measurements for heaping and appropriate strategies to cope with it are discussed afterwards. Heaped data are linked with a loss of information and hence are found to deteriorate effects on the macro- and micro-level. Therefore, exploration of the relationships between heaping behavior and personal as well as context information is valuable. This work provides descriptive evidence for heaping behavior in the income data of the German National Educational Panel Study (NEPS). The data at hand strongly support the assumption that heaping behavior is not stochastic but deterministic, i.e. whether and to which degree heaping occurs is not random. Respective determinants influencing heaping behavior are the response value itself and common socio-economic characteristics. Male, higher educated, and older respondents have a higher propensity to heap their income. Because of that, there is a necessity of adequately addressing this issue, e.g. by a modeling strategy which explicitly takes the non-randomness of the heaping behavior into consideration. According to this, a heaping model is introduced enabling to account for different heaping behaviors. The model is a mixture of two components, the latent distribution and the model for the heaping behavior. A zero-inflated log-normal distribution with a piecewise constant heaping mechanism is defined as base model. The generality and flexibility of the established model is outlined by several modifications and extensions, with respect to the latent distribution, the heaping pattern as well as the heaping mechanism. In the application, all proposed models are explored concerning their fit to the NEPS income data. Posterior predictive checks are used to access the overall fit of the models. This thesis also includes a comparative analysis of different random-walk Metropolis (RWM) algorithms with respect to their estimation accuracy and efficiency. Besides the original RWM algorithm, blocking and adaptive strategies are inquired into. The results indicate that blocking can greatly improve mixing and convergence of the RWM algorithm, in contrast to the adaptive schemes considered. The performance of the models is fairly good, however, large differences in estimation exist with respect to runtime and efficiency. These differences are mainly attributable to the model assumed and the selected specification of the RWM algorithm.
GND Keywords: ; ;
Einkommensverteilung
Häufigkeitsverteilung
Antwortverhalten
Keywords: ; ; ; ;
heaping
finite mixture model
random-walk Metropolis algorithm
block Metropolis-Hastings algorithm
adaptive MCMC
DDC Classification:
Type:
Doctoralthesis
Activation date:
March 3, 2017
Permalink
https://fis.uni-bamberg.de/handle/uniba/41034