five

Repeated Survey Responses of Large Language Models: A Dataset for Measuring Intra-Model Variability and Ideological Bias (Russian-Language Context, Kazakhstan)

收藏
DataCite Commons2026-05-02 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.19980770
下载链接
链接失效反馈
官方服务:
资源简介:
Description This dataset contains repeated responses generated by large language models (LLMs) to a standardized questionnaire designed to measure ideological orientations and value-based attitudes. The data were collected within an experimental framework aimed at assessing both inter-model differences and intra-model variability under controlled conditions. The dataset includes responses from three models: ChatGPT (n = 101), Alice (n = 99), and Copilot (n = 32). All interactions were conducted in Russian using identical prompts, ensuring comparability across models and iterations. Each model completed the same set of scale-based questions multiple times, allowing for the analysis of response dispersion, stability, and distributional characteristics. The questionnaire consists of 12 indicators (Q1–Q10c) capturing attitudes toward key socio-political dimensions, including democracy, leadership, redistribution, state responsibility, law compliance, traditions, national identity, and institutional trust. The dataset is structured in a tabular format, where each row represents a single model run and each column corresponds to a survey item. The primary purpose of the dataset is to enable the study of: intra-model variability (consistency of responses within the same model), inter-model differences (systematic variation across models), distributional properties of responses beyond mean values, potential ideological bias and its stability across repeated elicitation. The dataset supports quantitative analysis using nonparametric statistical methods and resampling techniques, including the estimation of variability indices such as the Intra-Model Variability (IMV) metric and bootstrap-based confidence intervals. This resource is particularly relevant for research in computational social science, AI evaluation, and algorithmic bias, providing an empirical basis for assessing reproducibility and stability in generative model outputs. The approach of repeated survey elicitation offers a replicable framework for examining how LLMs respond to socially sensitive content over multiple interactions, addressing limitations of single-response evaluation designs.
提供机构:
Zenodo
创建时间:
2026-05-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作