juletxara/xnli_mt
收藏Hugging Face2023-07-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/juletxara/xnli_mt
下载链接
链接失效反馈官方服务:
资源简介:
该数据集涉及跨语言自然语言推理(XNLI),包含多个不同模型大小的配置。每个配置包含前提、假设和标签等特征,标签分为蕴含、中立或矛盾。数据集被分割成多种语言,每种语言都有指定的字节数和示例数。
该数据集涉及跨语言自然语言推理(XNLI),包含多个不同模型大小的配置。每个配置包含前提、假设和标签等特征,标签分为蕴含、中立或矛盾。数据集被分割成多种语言,每种语言都有指定的字节数和示例数。
提供机构:
juletxara
原始信息汇总
数据集概述
数据集名称
- pretty_name: Cross-lingual Natural Language Inference
数据集配置和特征
配置: nllb-200-distilled-600M
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: nllb-200-distilled-1.3B
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: nllb-200-1.3B
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: nllb-200-3.3B
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: xglm-564M
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: xglM-1.7B
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: xglM-2.9B
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: xglM-4.5B
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: xglM-7.5B
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: bloom-560m
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: bloom-1b1
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: bloom-1b7
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: bloom-3b
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: bloom-7b1
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: llama-7B
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: llama-13B
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: RedPajama-INCITE-Base-3B-v1
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
配置: RedPajama-INCITE-7B-Base
- 特征:
- premise: string
- hypothesis: string
- label:
- dtype: class_label
- names:
- 0: entailment
- 1: neutral
- 2: contradiction
数据集大小和下载大小
-
nllb-200-distilled-600M:
- download_size: 11040341
- dataset_size: 11881671
-
nllb-200-distilled-1.3B:
- download_size: 11043528
- dataset_size: 11884858
-
nllb-200-1.3B:
- download_size: 11082057
- dataset_size: 11923387
-
nllb-200-3.3B:
- download_size: 11148008
- dataset_size: 11989338
-
xglM-564M:
- download_size: 11533534
- dataset_size: 12374864
-
xglM-1.7B:
- download_size: 10871776
- dataset_size: 11713106
-
xglM-2.9B:
- download_size: 10586622
- dataset_size: 11427952
-
xglM-4.5B:
- download_size: 10968672
- dataset_size: 11810002
-
xglM-7.5B:
- download_size: 10699999
- dataset_size: 11541329
-
bloom-560m:
- download_size: 13312268
- dataset_size: 14162991
-
bloom-1b1:
- download_size: 10548239
- dataset_size: 11389569
-
bloom-1b7:
- download_size: 10580096
- dataset_size: 11421426
-
bloom-3b:
- download_size: 10727323
- dataset_size: 11568653
-
bloom-7b1:
- download_size: 10776918
- dataset_size: 11618248
-
llama-7B:
- download_size: 10731053
- dataset_size: 11572383
-
llama-13B:
- download_size: 10726595
- dataset_size: 11567925
-
RedPajama-INCITE-Base-3B-v1:
- download_size: 11004105
- dataset_size: 11845435
-
RedPajama-INCITE-7B-Base:
- download_size: 11004105
- dataset_size: 11845435
数据集语言
- language: en



