five

German Parliamentary Speeches

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10895275
下载链接
链接失效反馈
官方服务:
资源简介:
These datasets are part of the thesis entitled "A natural language processing analysis of parliamentary speeches in the German Bundestag from 1949 to 2023 with regards to gender equality and women's politics". The focus of the thesis lies on contrasting the thematic preferences of men and women, as well as their connection to women's political issues based on their speeches and highlighting their historical development. Furthermore, the speakers’ reception in parliament is analyzed and the gender-specific distinguishability of speech style is verified. The data is available in a pandas DataFrame format.   The plenary protocol data has been extracted from the following two sources: Blaette, Andreas (2017): GermaParl. Corpus of Plenary Protocols of the German Bundestag. TEI files, availables at: https://github.com/PolMine/GermaParlTEI Deutscher Bundestag. Open Data - Plenarprotokolle der 20. Wahlperiode und Stammdaten aller Abgeordneten seit 1949. https://www.bundestag.de/services/opendata   The following datasets are contained: data_merged_revised.pkl: DataFrame containing the extracted and cleaned data from the sources mentioned above  data_topics_revised.pkl: DataFrame containing the data from the sources mentioned above and the topic distributions after topic modelling data_undersampled_processed.pkl: Undersampled and preprocessed dataset which can be used for training classification models
创建时间:
2024-03-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作