five

Famezz/Safe_UnSafe_dataset

收藏
Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Famezz/Safe_UnSafe_dataset
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含9,035条意大利语和英语的标准化查询,标记为SAFE或UNSAFE,专门用于训练二元分类模型以检测用户输入中的毒性和侮辱性语言。数据集的语言为意大利语(it)和英语(en),任务为文本分类(二元),重点是检测有毒语言和侮辱。数据集结构包含text和label两列,其中text是预处理和标准化的用户查询,label是分类标签(SAFE或UNSAFE)。数据集的收集结合了真实世界的数据集(如BeaverTails)和合成数据,并经过了去重、去除格式伪影、标准化和长度过滤等预处理步骤。数据集适用于毒性检测和聊天机器人安全性等用途,但也存在一些局限性,如未涵盖其他安全风险如提示注入或PII泄漏。

This dataset contains 9,035 normalized queries in both Italian and English, labeled as either SAFE or UNSAFE. It is specifically designed to train binary classification models to detect toxicity and insults in user inputs. The languages are Italian (it) and English (en), and the task is text classification (binary) with a focus on detecting toxic language and insults. The dataset structure includes columns for text (preprocessed and normalized user query) and label (SAFE or UNSAFE). The dataset is a hybrid composition of real-world datasets (e.g., BeaverTails) and synthetic data, and has undergone preprocessing steps such as deduplication, artifact removal, normalization, and length filtering. It is suitable for toxicity detection and chatbot safety, but has limitations such as not covering other safety risks like prompt injection or PII leakage.
提供机构:
Famezz
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作