Raw data and analysis code for the study “Project 2025 as a technocratic blueprint: A corpus-based linguistic analysis of conservative governance discourse”
收藏DataCite Commons2025-11-11 更新2026-02-08 收录
下载链接:
https://bonndata.uni-bonn.de/citation?persistentId=doi:10.60507/FK2/BK14F8
下载链接
链接失效反馈官方服务:
资源简介:
This repository contains all data and scripts used for the study “Project 2025 as a Technocratic Blueprint: A Corpus-Based Linguistic Analysis of Conservative Governance Discourse” (Schilling & Fuchs, 2025).
The study investigates the language of the Heritage Foundation’s Project 2025, a 900-page conservative policy blueprint, using methods from Corpus-Assisted Discourse Studies (CADS), Political Discourse Analysis (PDA), and psycholinguistic text analysis. The corpus includes Project 2025 and Democratic and Republican Party platforms (2016–2024).
The dataset includes:
Raw data (/data/raw_data/): full texts of Project 2025 and party platforms in CSV format.
Processed data (/data/raw_data/Project2025_lemmaPOS.csv): tokenized, lemmatized, and POS-tagged text.
Keyness results (/data/keyness/): unigram and bigram keyness calculations (log-likelihood, log-ratio).
Collocation results (/data/collocations/): top 10 adjective, noun, and verb collocates per node.
LIWC results (/data/liwc/): LIWC-22 category scores for each corpus.
Analysis scripts (/code/): R Markdown file (analysis_project2025.Rmd) and two Python scripts for collocation and keyness analysis.
All files are in UTF-8 plain-text format. The dataset contains no personal, sensitive, or proprietary data and derives entirely from publicly accessible political documents.
提供机构:
bonndata
创建时间:
2025-11-05



