Data Files

Name: Data Files
Creator: figshare
Published: 2026-04-09 06:05:04
License: 暂无描述

DataCite Commons2026-04-09 更新2026-04-25 收录

下载链接：

https://figshare.com/articles/dataset/Data_Files/30353269

下载链接

链接失效反馈

官方服务：

资源简介：

This study examines the fidelity of datasets generated by generative AI (GenAI) tools in comparison to real-world survey data, addressing methodological implications for computational social science. Using an original survey as a benchmark, we created three datasets: authentic respondent data, GenAI-simulated datasets produced through programmed distributional prompts, and GenAI-reconstructed datasets designed to replicate plausible human responses. Both artificial datasets were generated using large language models, specifically ChatGPT and Perplexity. Comparative analyses focused on distributional accuracy, reliability, and hypothesis-testing validity. Findings show that GenAI-simulated datasets approximated statistical distributions but lacked natural response heterogeneity, whereas GenAI-reconstructed datasets captured contextual patterns and nuanced correlations but diverged from exact statistical parameters. These results highlight the distinct yet complementary roles of AI-driven simulation and reconstruction for replication and robustness testing. The study contributes to discussions on reproducibility, credibility, and ethical integration of GenAI in high-impact social science research.

提供机构：

figshare

创建时间：

2025-10-14

5,000+

优质数据集

54 个

任务类型

进入经典数据集