Options
HamBam — The Hamedan-Bamberg Corpus of Contemporary Spoken Persian
Faculty
Contributor(s):
Publisher Information:
Otto-Friedrich-Universität Bamberg
Year of publication:
2025
Language:
Multilingual/Other
Abstract:
HamBam, the Hamedan-Bamberg Corpus of Contemporary Spoken Persian (Haig & Rasekh-Mahand 2022), is an unrestrictedly accessible online corpus of contemporary spoken Persian. The design of the corpus follows the architecture and rationale of Multi-CAST (Haig & Schnell 2015), but with certain modifications. As in Multi-CAST, the texts are annotated using the free annotation software ELAN, which links sound files to annotation files. The annotated data are available in various formats (sound files, ELAN annotation files, tab-separated value files, and XML). This archive contains version 3.0 of the corpus (published in October 2025), which has been edited and expanded with six additional recordings. It fully supersedes all earlier versions.
HamBam at a glance
number of individual recordings: 44
total runtime: 166 minutes
total grammatical words: 20000
The HamBam team
Geoffrey Haig
Mohammad Rasekh-Mahand
Elham Izadi
Fariba Sabouri
Maryam Pouyankhah
Iran Abdi
Mehdi Parizadeh
Mehrdad Meshkinfam
Laurentia Schreiber
N. Schiborr
Citation
Haig, Geoffrey & Rasekh-Mahand, Mohammad. 2022. HamBam: Hamedan-Bamberg Corpus of Contemporary Spoken Persian. Version 3.0. (DOI: 10.48564/unibafd-v80bg-h0243)
HamBam at a glance
number of individual recordings: 44
total runtime: 166 minutes
total grammatical words: 20000
The HamBam team
Geoffrey Haig
Mohammad Rasekh-Mahand
Elham Izadi
Fariba Sabouri
Maryam Pouyankhah
Iran Abdi
Mehdi Parizadeh
Mehrdad Meshkinfam
Laurentia Schreiber
N. Schiborr
Citation
Haig, Geoffrey & Rasekh-Mahand, Mohammad. 2022. HamBam: Hamedan-Bamberg Corpus of Contemporary Spoken Persian. Version 3.0. (DOI: 10.48564/unibafd-v80bg-h0243)
Type:
Collection
Keywords: ;
corpus
spoken persian
Version:
3.0
Permalink
https://fis.uni-bamberg.de/handle/uniba/111027