ArkAiLab-Adl/Alen-Walk-safeguard-dataset-v1-mini

Name: ArkAiLab-Adl/Alen-Walk-safeguard-dataset-v1-mini
Creator: ArkAiLab-Adl
Published: 2025-12-15 04:22:46
License: 暂无描述

Hugging Face2025-12-15 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/ArkAiLab-Adl/Alen-Walk-safeguard-dataset-v1-mini

下载链接

链接失效反馈

官方服务：

资源简介：

AlenWalk Safeguard Dataset v1 Mini是一个开放的、精心策划的数据集，专注于检测文本中的不安全、滥用、性和威胁语言。它旨在构建AI审核系统、AutoMod管道和内容安全分类器。该数据集由AlenWalk项目开发和策划，强调数字严重性标签，适用于快速、实时的审核用例。数据集提供了一个紧凑但多样化的文本样本集合，用于内容安全和审核研究。这个Mini版本旨在作为一个轻量级的入口点和基础，提供清晰的基于严重性的数字标签、实用的审核导向类别，并易于集成到基于规则和AI驱动的系统中。它适用于开发者希望在扩展到更大版本之前使用一个干净、透明的数据集。

The **AlenWalk Safeguard Dataset v1 Mini** is an open, curated dataset focused on **detecting unsafe, abusive, sexual, and threatening language in text**. It is designed for building **AI moderation systems**, AutoMod pipelines, and content safety classifiers. Developed and curated under the **AlenWalk** project, this dataset emphasizes **numeric severity labeling** for fast, real-time moderation use cases. AlenWalk Safeguard Dataset v1 Mini provides a compact but diverse collection of text samples labeled for content safety and moderation research. This **Mini** release is intended as a lightweight entry point and foundation, offering clear severity-based numeric labels, practical moderation-oriented categories, and easy integration into rule-based and AI-driven systems. It is ideal for developers who want a **clean, transparent dataset** before scaling to larger releases.

提供机构：

ArkAiLab-Adl

5,000+

优质数据集

54 个

任务类型

进入经典数据集