anitamaxvim/jigsaw-toxic-comments
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/anitamaxvim/jigsaw-toxic-comments
下载链接
链接失效反馈官方服务:
资源简介:
Jigsaw Toxic Comments数据集是一个为Kaggle上的毒性评论分类挑战而创建的基准数据集。它旨在帮助开发能够识别和分类多个毒性类别的在线评论的机器学习模型。该数据集包含评论文本和多种毒性的二进制标签,如毒性、严重毒性、粗俗、威胁、侮辱和身份仇恨。数据集来源于在线评论平台,经过人工标注和预处理,以确保数据的质量和一致性。
The Jigsaw Toxic Comments dataset is a benchmark dataset created for the Toxic Comment Classification Challenge on Kaggle. It is designed to help develop machine learning models that can identify and classify toxic online comments across multiple categories of toxicity, including Toxic, Severe toxic, Obscene, Threat, Insult, and Identity hate. The dataset is sourced from online comment platforms and has been manually annotated and preprocessed for data quality and consistency.
提供机构:
anitamaxvim



