five

klamas/russian-toxic

收藏
Hugging Face2026-04-09 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/klamas/russian-toxic
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit configs: - config_name: default data_files: - split: train path: data/train-* - split: test path: data/test-* dataset_info: features: - name: text dtype: string - name: label dtype: class_label: names: '0': negative '1': positive splits: - name: train num_bytes: 65009575.70216127 num_examples: 278021 - name: test num_bytes: 16252569.297838729 num_examples: 69506 download_size: 45469780 dataset_size: 81262145 task_categories: - text-classification language: - ru tags: - russian - toxic - classification size_categories: - 100K<n<1M --- # Russian toxic text datasets This datasets is a merge of - [AlexSham/Toxic_Russian_Comments](https://huggingface.co/datasets/AlexSham/Toxic_Russian_Comments) - [marriamaslova/toxic_dvach](https://huggingface.co/datasets/marriamaslova/toxic_dvach) - [textdetox/multilingual_toxicity_dataset](https://huggingface.co/datasets/textdetox/multilingual_toxicity_dataset) - Parsed toxic and non toxic texts from VK Stream API There is more than 300k toxic and non toxic comments С правильными лейблами
提供机构:
klamas
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作