Generative AI Content Moderation Policies & Public Discussion Dataset

Name: Generative AI Content Moderation Policies & Public Discussion Dataset
Creator: figshare
Published: 2025-06-06 17:51:05
License: 暂无描述

DataCite Commons2025-06-06 更新2025-09-08 收录

下载链接：

https://figshare.com/articles/dataset/Generative_AI_Content_Moderation_Policies_Public_Discussion_Dataset/29257187

下载链接

链接失效反馈

官方服务：

资源简介：

IntroductionThis is a dataset of the USENIX Security '25 paper: ''I Cannot Write This Because It Violates Our Content Policy'': Understanding Content Moderation Policies and User Experiences in Generative AI Products.Structure and file overviewThis dataset includes two parts:[Policy Dataset] Folder /Study1_Policy_Analysis:Folder /Policy_Dataset contains a collection of screenshots of content moderation policy pages in Generative AI online tools. All screenshots are in .pdf format, and are placed into three folders under this directory based on the type of pages. info.pdf contains information on the Generative AI online tools and page URLs of the screenshots.Folder /Coded_Policy_Segments contains analysis on the policy dataset. coded_policy.xlsx contains annotated policy segments. codebook_study1.pdf contains the codebook for data analysis.[Reddit Dataset] Folder /Study2_Reddit_Analysis:Folder /Reddit_Dataset contains a collection of Reddit posts discussing content moderation in Generative AI online tools. All collected posts can be found in all_cleaned_posts.csv. In Folder ./By_Subreddits, the dataset is divided into seven .csv files by different subreddits where the posts were collected.codebook_study2.pdf contains the codebook for the data analysis we presented in the paper.

提供机构：

figshare

创建时间：

2025-06-06

5,000+

优质数据集

54 个

任务类型

进入经典数据集

Generative AI Content Moderation Policies &amp; Public Discussion Dataset

Generative AI Content Moderation Policies & Public Discussion Dataset