AlignmentResearch/ClearHarm

Name: AlignmentResearch/ClearHarm
Creator: AlignmentResearch
Published: 2025-05-23 18:21:40
License: 暂无描述

Hugging Face2025-05-23 更新2025-07-05 收录

下载链接：

https://hf-mirror.com/datasets/AlignmentResearch/ClearHarm

下载链接

链接失效反馈

官方服务：

资源简介：

这是一个包含文本数据的分类数据集，用于区分内容是否为良性或有害。数据集包含四个配置：default、neg、pos和rep40。每个配置都有clf_label（分类标签）、instructions（指令）、content（内容）、answer_prompt（答案提示）、proxy_clf_label（代理分类标签）、gen_target（生成目标）和proxy_gen_target（代理生成目标）等特征。clf_label有两个可能的值：良性（Benign）和有害（Harmful）。数据集分为训练集和验证集，但某些配置的验证集大小为零。数据集的总大小和下载大小在README中给出。

This is a text classification dataset for distinguishing between Benign and Harmful content. The dataset consists of four configurations: default, neg, pos, and rep40. Each configuration includes features such as clf_label (classification label), instructions, content, answer_prompt, proxy_clf_label, gen_target, and proxy_gen_target. The clf_label feature has two possible values: Benign and Harmful. The dataset is split into training and validation sets, although the validation set size for some configurations is zero. The total size and download size of the dataset are provided in the README.

提供机构：

AlignmentResearch

5,000+

优质数据集

54 个

任务类型

进入经典数据集