Generative AI Content Moderation Policies & Public Discussion Dataset

Figshare2025-06-06 更新2026-04-28 收录

下载链接：

https://figshare.com/articles/dataset/Generative_AI_Content_Moderation_Policies_Public_Discussion_Dataset/29257187

下载链接

链接失效反馈

官方服务：

资源简介：

IntroductionThis is a dataset of the USENIX Security '25 paper: ''I Cannot Write This Because It Violates Our Content Policy'': Understanding Content Moderation Policies and User Experiences in Generative AI Products.Structure and file overviewThis dataset includes two parts:[Policy Dataset] Folder /Study1_Policy_Analysis:Folder /Policy_Dataset contains a collection of screenshots of content moderation policy pages in Generative AI online tools. All screenshots are in .pdf format, and are placed into three folders under this directory based on the type of pages. info.pdf contains information on the Generative AI online tools and page URLs of the screenshots.Folder /Coded_Policy_Segments contains analysis on the policy dataset. coded_policy.xlsx contains annotated policy segments. codebook_study1.pdf contains the codebook for data analysis.[Reddit Dataset] Folder /Study2_Reddit_Analysis:Folder /Reddit_Dataset contains a collection of Reddit posts discussing content moderation in Generative AI online tools. All collected posts can be found in all_cleaned_posts.csv. In Folder ./By_Subreddits, the dataset is divided into seven .csv files by different subreddits where the posts were collected.codebook_study2.pdf contains the codebook for the data analysis we presented in the paper.

创建时间：

2025-06-06

5,000+

优质数据集

54 个

任务类型

进入经典数据集