five

SVKCorp: Corpus of Debates in the National Council of the Slovak Republic

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/7020474
下载链接
链接失效反馈
官方服务:
资源简介:
This is a repository for the corpus of transcripts of parliamentary debates in the National Council of the Slovak Republic (https://www.nrsr.sk/web/). Transcripts of speeches were text-mined and cleaned from machine-readable word documents and the official online database of the Parliament. The corpus covers the period of 1994-2023 and counts eight complete terms with 437 628 speeches. The repository contains two types of data files. The corpus is stored on per term basis in .RDS (SK_term_*.RDS) and .tsv  (SK_term_*.tsv) formats. For further information on data structure, see the accompanying codebook. If you use the dataset, please cite it with the meta link for the whole repository: Mochtak, Michal (2024): SVKCorp: Corpus of Debates in the National Council of the Slovak Republic, v2 (1994-2023), https://doi.org/10.5281/zenodo.7020474. Changelog:2.0.0 (26/03/2024; latest version)— Term 8 added; outdated CoNLL-U files removed1.0.0 (24/08/2022)— Original upload
创建时间:
2024-07-06
二维码
社区交流群
二维码
科研交流群
商业服务