Raw data and analysis code for the study “Project 2025 as a technocratic blueprint: A corpus-based linguistic analysis of conservative governance discourse”

Name: Raw data and analysis code for the study “Project 2025 as a technocratic blueprint: A corpus-based linguistic analysis of conservative governance discourse”
Creator: bonndata
Published: 2025-11-11 12:47:45
License: 暂无描述

DataCite Commons2025-11-11 更新2026-02-08 收录

下载链接：

https://bonndata.uni-bonn.de/citation?persistentId=doi:10.60507/FK2/BK14F8

下载链接

链接失效反馈

官方服务：

资源简介：

This repository contains all data and scripts used for the study “Project 2025 as a Technocratic Blueprint: A Corpus-Based Linguistic Analysis of Conservative Governance Discourse” (Schilling & Fuchs, 2025). The study investigates the language of the Heritage Foundation’s Project 2025, a 900-page conservative policy blueprint, using methods from Corpus-Assisted Discourse Studies (CADS), Political Discourse Analysis (PDA), and psycholinguistic text analysis. The corpus includes Project 2025 and Democratic and Republican Party platforms (2016–2024). The dataset includes: Raw data (/data/raw_data/): full texts of Project 2025 and party platforms in CSV format. Processed data (/data/raw_data/Project2025_lemmaPOS.csv): tokenized, lemmatized, and POS-tagged text. Keyness results (/data/keyness/): unigram and bigram keyness calculations (log-likelihood, log-ratio). Collocation results (/data/collocations/): top 10 adjective, noun, and verb collocates per node. LIWC results (/data/liwc/): LIWC-22 category scores for each corpus. Analysis scripts (/code/): R Markdown file (analysis_project2025.Rmd) and two Python scripts for collocation and keyness analysis. All files are in UTF-8 plain-text format. The dataset contains no personal, sensitive, or proprietary data and derives entirely from publicly accessible political documents.

提供机构：

bonndata

创建时间：

2025-11-05