five

Laissez-Faire Prompts Dataset

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://doi.org/10.7910/DVN/WF8PJD
下载链接
链接失效反馈
官方服务:
资源简介:
We created this dataset for the purpose of studying biases in response to open-ended prompts that describe everyday usage, including students interfacing with LM-based writing assistants and screenwriters or authors using LMs to assist in fictional writing. This dataset primarily studies the context of life in the United States, although we believe that many of the same principles used in its construction can be adapted to settings in other nations and societies globally. The instances comprising the dataset represent (1) synthetic texts generated by five generative language models (ChatGPT 3.5, ChatGPT 4, Claude 2.0, Llama 2 (7B chat), and PaLM 2) in response to open-ended prompts listed in Supplementary Tables 3, 4, and 5 in addition to (2) co-reference labels for gender references and names of the fictional characters represented in each synthetic text, extracted directly from the synthetic text. There are 500,000 instances in total or 100K per model that can be further subdivided into 50K power-neutral prompts and 50K power-laden prompts, each of which contains 15K Learning prompts, 15K Labor prompts, and 20K Love prompts.
创建时间:
2024-04-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作