five

Czech Parliamentary Speeches Dataset 1993-2023; full transcripts of plenary speeches from the Chamber of Deputies

收藏
DataONE2024-05-11 更新2024-10-19 收录
下载链接:
https://search.dataone.org/view/sha256:43382ea1260f73fbd63e75b9f54f4a54e5dbcf5eed941714305abae68b29286d
下载链接
链接失效反馈
官方服务:
资源简介:
The Czech Parliamentary Speeches Dataset contains full-text vectors of 470 007 plenary speeches in the Czech Chamber of Deputies during its entire existence from 1993 to 2023. The dataset contains variables with the entire transcript, speaker delivering the speech, political party, binary indication of whether he is the chair or not, monthyear and year. The dataset is cleaned of missing values, the text column is lowered for quantitative text analysis and the column monthyear and year are added for time series or TSCS analysis. The dataset is intented mainly for quantitative text analysis for researchers in political science, sociology, media studies, law, etc. But in open science the possibilities are (almost) endless. The dataset is in part an extension of The ParlSpeech V2 by Rauh & Schwalbach (2020), parts of which were altered, updated and used to create the early years of this dataset. The later years were scraped directly from the Chamber of Deputies website in part thanks to a script by Ondrej Kokes https://github.com/kokes/od, to provide open access to data about the the Czech public financing and governance. I would like to thus thank Rauh & Schwalbach and Kokes, without which I would not have been able to construct this dataset. I claim only small original inputs as the main contribution is meant to be in updating and compiling existing data into a clean and analysis ready dataset.
创建时间:
2024-09-24
二维码
社区交流群
二维码
科研交流群
商业服务