A new Name-Based Sampling Method for Migrants using n-grams



Faculty/Professorship: Survey Methodology  
Author(s): Schnell, Rainer; Gramlich, Tobias; Bachteler, Tobias; Reiher, Jörg; Trappmann, Mark ; Smid, Menno; Becher, Inna
Corporate Body: German Record-Linkage Center
Publisher Information: Nürnberg
Year of publication: 2013
Pages: 29 ; Illustrationen
Series ; Volume: Working paper series, 2013-04
Language(s): English
URL: http://www.iab.de/389/section.aspx/Publikation/...
Abstract: 
The set of best methods for sampling migrant populations includes
name-based sampling. So far this is done using either ad hoc lists
or onomastic dictionaries for the classification of names. This paper
proposes a new name-based procedure which uses a Bayes-classifier
for the n-grams of the name. The new procedure is fault-tolerant of
alternate spellings, and also allows the classification of names that are
not found in dictionaries. It was tested using the names of about 1.600
foreigners in the PASS panel. Finally, a CATI survey based on the new
method in Hesse (Germany) is described.
Keywords: Onomastics, rare populations, bigrams, trigrams
Type: Workingpaper
URI: https://fis.uni-bamberg.de/handle/uniba/1960
Year of publication: 12. August 2013