basqueparl
收藏arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/hitz/basqueparl
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含大量巴斯克语-西班牙语代码转换的双语巴斯克议会发言转录语料库,它丰富了与发言人和演讲相关的元数据信息。这些元数据包括语言、性别和政治党派等详细信息,使得研究者能够对语言使用随时间的变化以及按人口统计因素进行的详细分析。该数据集的规模覆盖了议会演讲,其任务在于分析双语背景下政治话语和语言使用情况。
This dataset is a bilingual transcribed corpus of Basque parliamentary speeches containing extensive Basque-Spanish code-switching instances. It enriches the metadata associated with individual speakers and their delivered speeches, with detailed attributes including the language(s) employed, speaker gender, and political party affiliation. Such comprehensive metadata allows researchers to perform in-depth analyses of temporal variations in language use as well as targeted studies based on demographic factors. Covering a substantial volume of parliamentary speech content, this dataset is developed to support research on political discourse and language usage within bilingual contexts.



