Options
Selection of models and variables : an automated Bayesian view for handling missing values
Bergrab, Michael (2026): Selection of models and variables : an automated Bayesian view for handling missing values, Bamberg: Otto-Friedrich-Universität, doi: 10.20378/irb-113953.
Faculty/Chair:
Leibniz-Institut für Bildungsverläufe (LIfBi)
Author:
Publisher Information:
Year of publication:
2026
Pages:
Supervisor:
Language:
English
Remark:
Dissertation, Otto-Friedrich-Universität Bamberg, 2026
DOI:
Abstract:
This dissertation addresses the challenges of model and variable selection in statistical analysis, with a particular focus on datasets with missing values. Adopting a Bayesian framework, it introduces an automated approach that explicitly accounts for uncertainty due to missing data while enabling robust selection of relevant variables. The core methodology integrates data augmentation and Markov Chain Monte Carlo (MCMC) techniques to jointly estimate the posterior distributions of parameters and missing values.
The work bridges theoretical foundations of Bayesian inference with practical applications, including the use of spike-and-slab priors for variable selection and the analysis of binary regression models with missing covariates. Empirical studies and simulations demonstrate the superiority of this approach over classical methods such as multiple imputation or shrinkage estimators (e.g., Lasso, Ridge, Elastic Net). The results indicate that the Bayesian approach not only enhances estimation accuracy but also improves model interpretability; especially in high-dimensional and high-missingness scenarios.
A key contribution of this work is its systematic approach to model selection, which relies on Bayes factors, marginal likelihoods, and information criteria (e.g., BIC) to compare competing models, even in the presence of incomplete data. These methods identify the model that strikes the optimal balance between fit and complexity, while model averaging and hierarchical priors ensure stability across different missingness mechanisms (MCAR, MAR, MNAR).
This dissertation contributes to statistical methodology by providing a coherent and flexible solution for the simultaneous handling of missing values, variable selection, and model comparison. It emphasizes the importance of accounting for uncertainty in data analysis and offers practical guidelines for implementation in empirical research.
The work bridges theoretical foundations of Bayesian inference with practical applications, including the use of spike-and-slab priors for variable selection and the analysis of binary regression models with missing covariates. Empirical studies and simulations demonstrate the superiority of this approach over classical methods such as multiple imputation or shrinkage estimators (e.g., Lasso, Ridge, Elastic Net). The results indicate that the Bayesian approach not only enhances estimation accuracy but also improves model interpretability; especially in high-dimensional and high-missingness scenarios.
A key contribution of this work is its systematic approach to model selection, which relies on Bayes factors, marginal likelihoods, and information criteria (e.g., BIC) to compare competing models, even in the presence of incomplete data. These methods identify the model that strikes the optimal balance between fit and complexity, while model averaging and hierarchical priors ensure stability across different missingness mechanisms (MCAR, MAR, MNAR).
This dissertation contributes to statistical methodology by providing a coherent and flexible solution for the simultaneous handling of missing values, variable selection, and model comparison. It emphasizes the importance of accounting for uncertainty in data analysis and offers practical guidelines for implementation in empirical research.
GND Keywords: ;
Bayes-Verfahren
Fehlende Daten
Keywords: ; ; ;
Bayesian estimation
Handling missing values
Variable selection
Model selection
DDC Classification:
RVK Classification:
Type:
Doctoralthesis
Activation date:
March 16, 2026
Permalink
https://fis.uni-bamberg.de/handle/uniba/113953