Norwegian Parliamentary Speech Corpus
收藏SSH Open MarketPlace2025-07-04 更新2025-07-05 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/R5CxJp
下载链接
链接失效反馈官方服务:
资源简介:
This corpus consists of audio recordings of meetings in [Stortinget](https://www.stortinget.no/no/) (the Norwegian parliament), and corresponding orthographic transcriptions in either Norwegian Bokmål or Norwegian Nynorsk, as well as various metadata about the speakers. The official proceedings from the meetings are also included in the corpus for reference.
Transcription was first done automatically; subsequently, the output of the automatic process was manually checked and corrected by trained linguists and philologists. Finally, all transcriptions were proofread to ensure consistency and accuracy. The audio files in the corpus contain the speech of entire days of plenary meetings from 2017 and 2018 (or, if a meeting lasts more than six hours, the first six hours of a day).
The corpus is available for download from the Norwegian Language Bank.
本语料库收录了挪威议会(Stortinget)会议的音频录音,以及对应的挪威书面语(Norwegian Bokmål)或新挪威语(Norwegian Nynorsk)的正字法转写文本,同时包含与会者的各类元数据。此外,会议的官方记录也一并收录于本语料库以供参考。
转写工作首先通过自动化流程完成,随后由经过专业训练的语言学家与语文学家对自动转写结果进行人工校验与修正。最后,所有转写文本均经过校对,以确保内容的一致性与准确性。
本语料库中的音频文件涵盖2017年与2018年全体会议的单日全部发言内容;若单日会议时长超过六小时,则仅收录当日会议的前六小时音频。
该语料库可从挪威语言库(Norwegian Language Bank)下载获取。
创建时间:
2025-07-04



