A new Name-Based Sampling Method for Migrants using n-grams
Faculty/Professorship: | Survey Methodology |
Author(s): | Schnell, Rainer; Gramlich, Tobias; Bachteler, Tobias; Reiher, Jörg; Trappmann, Mark ; Smid, Menno; Becher, Inna |
Corporate Body: | German Record-Linkage Center |
Publisher Information: | Nürnberg |
Year of publication: | 2013 |
Pages: | 29 ; Illustrationen |
Series ; Volume: | Working paper series, 2013-04 |
Language(s): | English |
URL: | http://www.iab.de/389/section.aspx/Publikation/... |
Abstract: | The set of best methods for sampling migrant populations includes name-based sampling. So far this is done using either ad hoc lists or onomastic dictionaries for the classification of names. This paper proposes a new name-based procedure which uses a Bayes-classifier for the n-grams of the name. The new procedure is fault-tolerant of alternate spellings, and also allows the classification of names that are not found in dictionaries. It was tested using the names of about 1.600 foreigners in the PASS panel. Finally, a CATI survey based on the new method in Hesse (Germany) is described. |
Keywords: | Onomastics, rare populations, bigrams, trigrams |
Type: | Workingpaper |
URI: | https://fis.uni-bamberg.de/handle/uniba/1960 |
Year of publication: | 12. August 2013 |

originated at the
University of Bamberg
University of Bamberg