German Parliamentary Speeches
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10895275
下载链接
链接失效反馈官方服务:
资源简介:
These datasets are part of the thesis entitled "A natural language processing analysis of parliamentary speeches in the German Bundestag from 1949 to 2023 with regards to gender equality and women's politics". The focus of the thesis lies on contrasting the thematic preferences of men and women, as well as their connection to women's political issues based on their speeches and highlighting their historical development. Furthermore, the speakers’ reception in parliament is analyzed and the gender-specific distinguishability of speech style is verified. The data is available in a pandas DataFrame format.
The plenary protocol data has been extracted from the following two sources:
Blaette, Andreas (2017): GermaParl. Corpus of Plenary Protocols of the German Bundestag. TEI files, availables at: https://github.com/PolMine/GermaParlTEI
Deutscher Bundestag. Open Data - Plenarprotokolle der 20. Wahlperiode und Stammdaten aller Abgeordneten seit 1949. https://www.bundestag.de/services/opendata
The following datasets are contained:
data_merged_revised.pkl: DataFrame containing the extracted and cleaned data from the sources mentioned above
data_topics_revised.pkl: DataFrame containing the data from the sources mentioned above and the topic distributions after topic modelling
data_undersampled_processed.pkl: Undersampled and preprocessed dataset which can be used for training classification models
创建时间:
2024-03-29



