five

llm_global_opinions

收藏
魔搭社区2025-12-05 更新2025-02-15 收录
下载链接:
https://modelscope.cn/datasets/Anthropic/llm_global_opinions
下载链接
链接失效反馈
官方服务:
资源简介:
# Dataset Card for GlobalOpinionQA ## Dataset Summary The data contains a subset of survey questions about global issues and opinions adapted from the [World Values Survey](https://www.worldvaluessurvey.org/) and [Pew Global Attitudes Survey](https://www.pewresearch.org/). The data is further described in the paper: [Towards Measuring the Representation of Subjective Global Opinions in Language Models](https://arxiv.org/abs/2306.16388). ## Purpose In our paper, we use this dataset to analyze the opinions that large language models (LLMs) reflect on complex global issues. Our goal is to gain insights into potential biases in AI systems by evaluating their performance on subjective topics. ## Data Format The data is in a CSV file with the following columns: - question: The text of the survey question. - selections: A dictionary where the key is the country name and the value is a list of percentages of respondents who selected each answer option for that country. - options: A list of the answer options for the given question. - source: GAS/WVS depending on whether the question is coming from Global Attitudes Survey or World Value Survey. ## Usage ```python from datasets import load_dataset # Loading the data dataset = load_dataset("Anthropic/llm_global_opinions") ``` ## Disclaimer We recognize the limitations in using this dataset to evaluate LLMs, as they were not specifically designed for this purpose. Therefore, we acknowledge that the construct validity of these datasets when applied to LLMs may be limited. ## Contact For questions, you can email esin at anthropic dot com ## Citation If you would like to cite our work or data, you may use the following bibtex citation: ``` @misc{durmus2023measuring, title={Towards Measuring the Representation of Subjective Global Opinions in Language Models}, author={Esin Durmus and Karina Nyugen and Thomas I. Liao and Nicholas Schiefer and Amanda Askell and Anton Bakhtin and Carol Chen and Zac Hatfield-Dodds and Danny Hernandez and Nicholas Joseph and Liane Lovitt and Sam McCandlish and Orowa Sikder and Alex Tamkin and Janel Thamkul and Jared Kaplan and Jack Clark and Deep Ganguli}, year={2023}, eprint={2306.16388}, archivePrefix={arXiv}, primaryClass={cs.CL} } ```

# GlobalOpinionQA 数据集卡片 ## 数据集概述 本数据集包含改编自[世界价值观调查(World Values Survey)](https://www.worldvaluessurvey.org/)与[皮尤全球态度调查(Pew Global Attitudes Survey)](https://www.pewresearch.org/)的全球议题与观点类调研问题子集。 本数据集的详细说明已发表于论文:[《量化语言模型中主观全球观点的表征》(Towards Measuring the Representation of Subjective Global Opinions in Language Models)](https://arxiv.org/abs/2306.16388)。 ## 数据集用途 在本研究论文中,我们使用该数据集分析大语言模型(Large Language Model,LLM)对复杂全球议题所反映出的观点。我们的目标是通过评估大语言模型在主观议题上的表现,探究人工智能系统中潜在的偏差问题。 ## 数据格式 本数据集以CSV文件格式存储,包含以下字段: - `question`:调研问题的文本内容。 - `selections`:一个字典对象,键为国家名称,值为对应国家选择各答案选项的受访者占比列表。 - `options`:该调研问题的答案选项列表。 - `source`:标注为`GAS`或`WVS`,分别表示问题来源于皮尤全球态度调查(Global Attitudes Survey,GAS)或世界价值观调查(World Values Survey,WVS)。 ## 使用方法 python from datasets import load_dataset # 加载数据集 dataset = load_dataset("Anthropic/llm_global_opinions") ## 免责声明 我们认识到使用本数据集评估大语言模型存在一定局限性,因为该数据集并非专门为这一评估场景设计。因此,我们承认将本数据集应用于大语言模型评估时,其结构效度可能存在局限。 ## 联系方式 如有任何疑问,可发送邮件至 `esin@anthropic.com`。 ## 引用方式 如需引用本研究或本数据集,请使用以下BibTeX格式的引用条目: @misc{durmus2023measuring, title={Towards Measuring the Representation of Subjective Global Opinions in Language Models}, author={Esin Durmus and Karina Nyugen and Thomas I. Liao and Nicholas Schiefer and Amanda Askell and Anton Bakhtin and Carol Chen and Zac Hatfield-Dodds and Danny Hernandez and Nicholas Joseph and Liane Lovitt and Sam McCandlish and Orowa Sikder and Alex Tamkin and Janel Thamkul and Jared Kaplan and Jack Clark and Deep Ganguli}, year={2023}, eprint={2306.16388}, archivePrefix={arXiv}, primaryClass={cs.CL} }
提供机构:
maas
创建时间:
2025-02-12
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集名为GlobalOpinionQA,基于世界价值观调查和皮尤全球态度调查,包含全球议题的问卷问题及其各国回答百分比数据,用于分析大语言模型在主观全球议题上的观点和潜在偏见。数据以CSV格式提供,包括问题、选择、选项和来源等列,适用于AI系统评估和研究。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作