llm_global_opinions

Name: llm_global_opinions
Creator: maas
Published: 2025-12-05 16:23:18
License: 暂无描述

魔搭社区2025-12-05 更新2025-02-15 收录

下载链接：

https://modelscope.cn/datasets/Anthropic/llm_global_opinions

下载链接

链接失效反馈

官方服务：

资源简介：

# Dataset Card for GlobalOpinionQA ## Dataset Summary The data contains a subset of survey questions about global issues and opinions adapted from the [World Values Survey](https://www.worldvaluessurvey.org/) and [Pew Global Attitudes Survey](https://www.pewresearch.org/). The data is further described in the paper: [Towards Measuring the Representation of Subjective Global Opinions in Language Models](https://arxiv.org/abs/2306.16388). ## Purpose In our paper, we use this dataset to analyze the opinions that large language models (LLMs) reflect on complex global issues. Our goal is to gain insights into potential biases in AI systems by evaluating their performance on subjective topics. ## Data Format The data is in a CSV file with the following columns: - question: The text of the survey question. - selections: A dictionary where the key is the country name and the value is a list of percentages of respondents who selected each answer option for that country. - options: A list of the answer options for the given question. - source: GAS/WVS depending on whether the question is coming from Global Attitudes Survey or World Value Survey. ## Usage ```python from datasets import load_dataset # Loading the data dataset = load_dataset("Anthropic/llm_global_opinions") ``` ## Disclaimer We recognize the limitations in using this dataset to evaluate LLMs, as they were not specifically designed for this purpose. Therefore, we acknowledge that the construct validity of these datasets when applied to LLMs may be limited. ## Contact For questions, you can email esin at anthropic dot com ## Citation If you would like to cite our work or data, you may use the following bibtex citation: ``` @misc{durmus2023measuring, title={Towards Measuring the Representation of Subjective Global Opinions in Language Models}, author={Esin Durmus and Karina Nyugen and Thomas I. Liao and Nicholas Schiefer and Amanda Askell and Anton Bakhtin and Carol Chen and Zac Hatfield-Dodds and Danny Hernandez and Nicholas Joseph and Liane Lovitt and Sam McCandlish and Orowa Sikder and Alex Tamkin and Janel Thamkul and Jared Kaplan and Jack Clark and Deep Ganguli}, year={2023}, eprint={2306.16388}, archivePrefix={arXiv}, primaryClass={cs.CL} } ```

# GlobalOpinionQA 数据集卡片 ## 数据集概述本数据集包含改编自[世界价值观调查（World Values Survey）](https://www.worldvaluessurvey.org/)与[皮尤全球态度调查（Pew Global Attitudes Survey）](https://www.pewresearch.org/)的全球议题与观点类调研问题子集。本数据集的详细说明已发表于论文：[《量化语言模型中主观全球观点的表征》（Towards Measuring the Representation of Subjective Global Opinions in Language Models）](https://arxiv.org/abs/2306.16388)。 ## 数据集用途在本研究论文中，我们使用该数据集分析大语言模型（Large Language Model，LLM）对复杂全球议题所反映出的观点。我们的目标是通过评估大语言模型在主观议题上的表现，探究人工智能系统中潜在的偏差问题。 ## 数据格式本数据集以CSV文件格式存储，包含以下字段： - `question`：调研问题的文本内容。 - `selections`：一个字典对象，键为国家名称，值为对应国家选择各答案选项的受访者占比列表。 - `options`：该调研问题的答案选项列表。 - `source`：标注为`GAS`或`WVS`，分别表示问题来源于皮尤全球态度调查（Global Attitudes Survey，GAS）或世界价值观调查（World Values Survey，WVS）。 ## 使用方法 python from datasets import load_dataset # 加载数据集 dataset = load_dataset("Anthropic/llm_global_opinions") ## 免责声明我们认识到使用本数据集评估大语言模型存在一定局限性，因为该数据集并非专门为这一评估场景设计。因此，我们承认将本数据集应用于大语言模型评估时，其结构效度可能存在局限。 ## 联系方式如有任何疑问，可发送邮件至 `esin@anthropic.com`。 ## 引用方式如需引用本研究或本数据集，请使用以下BibTeX格式的引用条目： @misc{durmus2023measuring, title={Towards Measuring the Representation of Subjective Global Opinions in Language Models}, author={Esin Durmus and Karina Nyugen and Thomas I. Liao and Nicholas Schiefer and Amanda Askell and Anton Bakhtin and Carol Chen and Zac Hatfield-Dodds and Danny Hernandez and Nicholas Joseph and Liane Lovitt and Sam McCandlish and Orowa Sikder and Alex Tamkin and Janel Thamkul and Jared Kaplan and Jack Clark and Deep Ganguli}, year={2023}, eprint={2306.16388}, archivePrefix={arXiv}, primaryClass={cs.CL} }

提供机构：

maas

创建时间：

2025-02-12

搜集汇总

数据集介绍