five

Data Files

收藏
DataCite Commons2025-10-14 更新2026-04-25 收录
下载链接:
https://figshare.com/articles/dataset/Data_Files/30353269/1
下载链接
链接失效反馈
官方服务:
资源简介:
This study examines the fidelity of datasets generated by generative AI (GenAI) tools in comparison to real-world survey data, addressing methodological implications for computational social science. Using an original survey as a benchmark, we created three datasets: authentic respondent data, GenAI-simulated datasets produced through programmed distributional prompts, and GenAI-reconstructed datasets designed to replicate plausible human responses. Both artificial datasets were generated using large language models, specifically ChatGPT and Perplexity. Comparative analyses focused on distributional accuracy, reliability, and hypothesis-testing validity. Findings show that GenAI-simulated datasets approximated statistical distributions but lacked natural response heterogeneity, whereas GenAI-reconstructed datasets captured contextual patterns and nuanced correlations but diverged from exact statistical parameters. These results highlight the distinct yet complementary roles of AI-driven simulation and reconstruction for replication and robustness testing. The study contributes to discussions on reproducibility, credibility, and ethical integration of GenAI in high-impact social science research.
提供机构:
figshare
创建时间:
2025-10-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作