lidiapierre/fr_sexism_labelled
收藏Hugging Face2023-10-08 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/lidiapierre/fr_sexism_labelled
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
dataset_info:
features:
- name: 'Unnamed: 0'
dtype: int64
- name: Sentences
dtype: string
- name: Label
dtype: int64
- name: fr_sentences
dtype: string
splits:
- name: train
num_bytes: 192216
num_examples: 1137
download_size: 119626
dataset_size: 192216
---
# Dataset Card for "fr_sexism_labelled"
Based on the Kaggle dataset [Sexist Workplace Statements](https://www.kaggle.com/datasets/dgrosz/sexist-workplace-statements).
This dataset features more than 1100 examples of statements of workplace sexism, roughly balanced between examples of certain sexism and ambiguous or neutral cases (labeled with a “1” and “0” respectively).
The original English dataset has been translated into French via machine translation with the [Helsinki-NLP/opus-mt-en-fr](https://huggingface.co/Helsinki-NLP/opus-mt-en-fr) model.
提供机构:
lidiapierre
原始信息汇总
数据集卡片 "fr_sexism_labelled"
数据集概述
- 数据来源: 基于Kaggle数据集Sexist Workplace Statements。
- 数据内容: 包含超过1100个工作场所性别歧视的陈述示例,大致平衡了确定性别歧视和模糊或中性案例(分别标记为“1”和“0”)。
- 语言转换: 原始英语数据集已通过Helsinki-NLP/opus-mt-en-fr模型机器翻译成法语。
数据集配置
- 默认配置:
- 数据文件:
- 分割: 训练集
- 路径:
data/train-*
- 数据文件:
数据集信息
- 特征:
- 名称: Unnamed: 0
- 数据类型: int64
- 名称: Sentences
- 数据类型: string
- 名称: Label
- 数据类型: int64
- 名称: fr_sentences
- 数据类型: string
- 名称: Unnamed: 0
- 分割:
- 名称: 训练集
- 字节数: 192216
- 示例数: 1137
- 名称: 训练集
- 下载大小: 119626
- 数据集大小: 192216



