HamBam — The Hamedan-Bamberg Corpus of Contemporary Spoken Persian
收藏DataCite Commons2025-10-14 更新2026-05-03 收录
下载链接:
https://fd-repo.uni-bamberg.de/doi/10.48564/unibafd-kqx47-c8g48
下载链接
链接失效反馈官方服务:
资源简介:
HamBam, the Hamedan-Bamberg Corpus of Contemporary Spoken Persian (Haig & Rasekh-Mahand 2022), is an unrestrictedly accessible online corpus of contemporary spoken Persian. The design of the corpus follows the architecture and rationale of Multi-CAST (Haig & Schnell 2015), but with certain modifications. As in Multi-CAST, the texts are annotated using the free annotation software ELAN, which links sound files to annotation files. The annotated data are available in various formats (sound files, ELAN annotation files, tab-separated value files, and XML). This archive contains version 3.0 of the corpus (published in October 2025), which has been edited and expanded with six additional recordings. It fully supersedes all earlier versions.
HamBam at a glance
number of individual recordings: 44
total runtime: 166 minutes
total grammatical words: 20000
The HamBam team
Geoffrey Haig
Mohammad Rasekh-Mahand
Elham Izadi
Fariba Sabouri
Maryam Pouyankhah
Iran Abdi
Mehdi Parizadeh
Mehrdad Meshkinfam
Laurentia Schreiber
N. Schiborr
Citation
Haig, Geoffrey & Rasekh-Mahand, Mohammad. 2022. HamBam: Hamedan-Bamberg Corpus of Contemporary Spoken Persian. Version 3.0. (DOI: 10.48564/unibafd-v80bg-h0243)
提供机构:
Otto-Friedrich-Universität Bamberg
创建时间:
2025-10-14



