How can high-tech manufatucring achieve high innovation productivity? A configurational path analysis under the TOE framework
收藏DataCite Commons2026-03-20 更新2026-05-04 收录
下载链接:
https://data.mendeley.com/datasets/hn9k7f2k99
下载链接
链接失效反馈官方服务:
资源简介:
Dataset and Code for: Configurational Pathways to Innovation Productivity: A Dynamic TOE-QCA Analysis of High-Tech Manufacturing
Description:
1. What is this dataset?
This repository contains the comprehensive dataset and original execution scripts (in R and Python) supporting the dynamic Qualitative Comparative Analysis (QCA) of high-tech manufacturing innovation productivity in China. It provides all necessary materials to fully reproduce the configurational path analysis, temporal trend visualizations, industry heterogeneity evaluations, and out-of-sample predictive validity tests presented in the manuscript based on the Technology-Organization-Environment (TOE) framework.
2. How was this dataset collected?
The raw panel data were collected from Chinese A-share listed high-tech manufacturing firms covering the period from 2015 to 2024. Financial and patent data were sourced from authoritative databases including CSMAR and WIND.
3. What files are included?
The repository is structured into 6 core files to ensure complete transparency and reproducibility:
PANELDATA.csv: The primary panel dataset containing the foundational data for the analytical sample, used as the main input for the dynamic QCA process.
DYNAMIC.R: The core R script utilizing the QCA and admisc packages. It executes the fuzzy-set calibration, necessity and sufficiency analyses (truth table minimization), and computes both between-group and within-group consistencies across different industry configurations.
Calibrated_Data.csv: The fully calibrated fuzzy-set dataset exported from the main QCA procedure, serving as the direct input for the out-of-sample testing.
Out-of-Sample Predictive Validity Test.py: A Python script utilizing pandas and seaborn to perform predictive validity testing on a holdout sample (2020-2024). It calculates the consistency and coverage of the specific configurations and automatically generates scatter plots for validation.
plot_data.csv: A highly structured dataset specifically extracted and formatted from the QCA clustering results, dedicated to generating temporal trend lines.
photo.R: An R script utilizing the ggplot2 package to read plot_data.csv and visualize the intertemporal evolutionary trends of configurational consistency over the decade.
4. How can this dataset be used?
Researchers and reviewers can download this complete package into a single local directory to achieve "plug-and-play" reproducibility. By running the R and Python scripts sequentially, users can replicate the exact configurational pathways, robustness checks, and high-quality figures discussed in the study. Furthermore, it serves as a methodological template for scholars intending to integrate dynamic QCA with machine-learning-inspired out-of-sample prediction in management research.
提供机构:
Mendeley Data
创建时间:
2026-03-20



