SpiCE: Speech in Cantonese and English

DataONE2021-05-20 更新2024-06-08 收录

下载链接：

https://search.dataone.org/view/sha256:260285d28e73c70813667f2ce1c5182c6cde3a45ea585d297fe03aea47395f45

下载链接

链接失效反馈

官方服务：

资源简介：

This is the Speech in Cantonese and English (SpiCE) corpus. SpiCE is an audio corpus of conversational Cantonese-English bilingual speech recorded in Vancouver, Canada during 2018-2020. The corpus includes high-quality recordings of 34 early bilinguals in both English and Cantonese. Participants completed a sentence reading task, storyboard narration, and conversational interview in each language. These different speech tasks are available in a single audio file for each language for each talker. A Praat textgrid file accompanies each audio file. The textgrids provide hand-corrected orthographic transcription and phoneme-level forced-alignment in Cantonese and English. As an open-access language resource, SpiCE will promote bilingualism research for a typologically distinct pair of languages, of which Cantonese remains understudied despite there being millions of speakers around the world. The SpiCE corpus is especially well-suited for phonetic research on conversational speech, and enables researchers to study cross-language within-speaker phenomena for a diverse group of early Cantonese-English bilinguals. These are areas with few existing high-quality resources. Corpus documentation is available at: https://spice-corpus.readthedocs.io/.

创建时间：

2023-12-28

5,000+

优质数据集

54 个

任务类型

进入经典数据集