AntoineBlanot/mnli-contrast

Name: AntoineBlanot/mnli-contrast
Creator: AntoineBlanot
Published: 2023-12-04 08:06:48
License: 暂无描述

Hugging Face2023-12-04 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/AntoineBlanot/mnli-contrast

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是基于mnli-3way数据集，并增加了instruction特征。对于原始数据集中的每个例子，根据其类别（entailment, contradiction, neutral），生成两个新的数据点，分别分配正面和负面的instruction和label_name。因此，这个数据集的大小是原始数据集的两倍。

提供机构：

AntoineBlanot

原始信息汇总

数据集卡片 "mnli-contrast"

数据集信息

特征

premise: 字符串类型
hypothesis: 字符串类型
instruction: 字符串类型
label_name: 字符串类型

分割

train:
- 字节数: 254483428
- 样本数: 785404
test:
- 字节数: 6297986
- 样本数: 19630

大小

下载大小: 54354034
数据集大小: 260781414

配置

default:
- 训练数据文件路径: data/train-*
- 测试数据文件路径: data/test-*

数据集描述

该数据集是 mnli-3way 数据集的扩展，新增了 instruction 特征。该特征及其相关的 label_name 描述了 premise 和 hypothesis 特征在原始数据集中的关系。

映射规则

如果原始样本属于 entailment 类别:
- 正例 (label_name == "positive"): 分配指令 "The meaning of the hypothesis is logically inferred from the meaning of the premise."
- 负例 (label_name == "negative"): 分配指令 "The meaning of the hypothesis either contradicts the meaning of the premise, is unrelated to it, or does not provide sufficient information to infer the meaning of the premise."
如果原始样本属于 contradiction 或 neutral 类别:
- 正例 (label_name == "positive"): 分配指令 "The meaning of the hypothesis either contradicts the meaning of the premise, is unrelated to it, or does not provide sufficient information to infer the meaning of the premise."
- 负例 (label_name == "negative"): 分配指令 "The meaning of the hypothesis is logically inferred from the meaning of the premise."

该数据集是原始数据集的两倍大小，因为每个样本都关联了一个正例和一个负例的指令。

5,000+

优质数据集

54 个

任务类型

进入经典数据集