mtc/xnli_de_sub_sampled_3000_with_explanations

Hugging Face2024-02-01 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/mtc/xnli_de_sub_sampled_3000_with_explanations

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: premise dtype: string - name: hypothesis dtype: string - name: label dtype: class_label: names: '0': entailment '1': neutral '2': contradiction - name: incoherent dtype: bool - name: explanation dtype: string splits: - name: train num_bytes: 596278 num_examples: 1838 - name: validation num_bytes: 504876 num_examples: 2490 - name: test num_bytes: 1017155 num_examples: 5010 download_size: 837120 dataset_size: 2118309 configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - split: test path: data/test-* ---

The dataset includes five features: premise, hypothesis, label, incoherent, and explanation. The label feature has three classes: entailment, neutral, and contradiction. The dataset is divided into three parts: train, validation, and test, containing 1838, 2490, and 5010 samples respectively. The download size of the dataset is 837120 bytes, and the total size is 2118309 bytes.

提供机构：

mtc

原始信息汇总

数据集概述

数据特征

premise: 类型为字符串。
hypothesis: 类型为字符串。
label: 类型为分类标签，包含以下类别：
- 0: entailment
- 1: neutral
- 2: contradiction
incoherent: 类型为布尔值。
explanation: 类型为字符串。

数据分割

train: 包含1838个样本，大小为596278字节。
validation: 包含2490个样本，大小为504876字节。
test: 包含5010个样本，大小为1017155字节。

数据集大小

下载大小: 837120字节。
数据集大小: 2118309字节。

配置

default:
- train: 路径为 data/train-*
- validation: 路径为 data/validation-*
- test: 路径为 data/test-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集