five

Czech Parliamentary Speeches Dataset 2016-2023; full transcripts of plenary speeches from the Chamber of Deputies

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://doi.org/10.7910/DVN/FOQUZF
下载链接
链接失效反馈
官方服务:
资源简介:
The Czech Parliamentary Speeches Dataset contains full-text vectors of plenary speeches in the Czech Chamber of Deputies from 2016 to 2023. The dataset was built as an extension of The ParlSpeech V2 by Rauh & Schwalbach (2020), where the Czech data end in June of 2016, to provide the remaining corpora of Czech speeches. The dataset contains variables with the entire transcript, speaker delivering the speech, political party, binary indication of whether he is the chair or not, monthyear and year. The dataset is cleaned of missing values, the text column is lowered for quantitative text analysis and the column monthyear and year are added for time series analysis. The dataset is intented mainly for quantitative text analysis for researchers in political science, sociology, media studies, law, etc. But in open science the possibilities are (almost) endless. The data were scraped directly from the Chamber of Deputies website in part thanks to codes by Ondrej Kokes reposited at https://github.com/kokes/od. I would like to thus thank Ondrej Kokes, as without his commitments to open science, the assembly of the dataset would have taken much longer.
创建时间:
2025-10-18
二维码
社区交流群
二维码
科研交流群
商业服务