five

Generative AI Content Moderation Policies & Public Discussion Dataset

收藏
Figshare2025-06-06 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Generative_AI_Content_Moderation_Policies_Public_Discussion_Dataset/29257187
下载链接
链接失效反馈
官方服务:
资源简介:
IntroductionThis is a dataset of the USENIX Security '25 paper: ''I Cannot Write This Because It Violates Our Content Policy'': Understanding Content Moderation Policies and User Experiences in Generative AI Products.Structure and file overviewThis dataset includes two parts:[Policy Dataset] Folder /Study1_Policy_Analysis:Folder /Policy_Dataset contains a collection of screenshots of content moderation policy pages in Generative AI online tools. All screenshots are in .pdf format, and are placed into three folders under this directory based on the type of pages. info.pdf contains information on the Generative AI online tools and page URLs of the screenshots.Folder /Coded_Policy_Segments contains analysis on the policy dataset. coded_policy.xlsx contains annotated policy segments. codebook_study1.pdf contains the codebook for data analysis.[Reddit Dataset] Folder /Study2_Reddit_Analysis:Folder /Reddit_Dataset contains a collection of Reddit posts discussing content moderation in Generative AI online tools. All collected posts can be found in all_cleaned_posts.csv. In Folder ./By_Subreddits, the dataset is divided into seven .csv files by different subreddits where the posts were collected.codebook_study2.pdf contains the codebook for the data analysis we presented in the paper.
创建时间:
2025-06-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作