ruanchaves/assin_por_Latn_to_spa_Latn
收藏Hugging Face2023-04-22 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ruanchaves/assin_por_Latn_to_spa_Latn
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: sentence_pair_id
dtype: int64
- name: premise
dtype: string
- name: hypothesis
dtype: string
- name: relatedness_score
dtype: float32
- name: entailment_judgment
dtype:
class_label:
names:
'0': NONE
'1': ENTAILMENT
'2': PARAPHRASE
- name: __language__
dtype: string
splits:
- name: train
num_bytes: 1052463
num_examples: 5000
- name: test
num_bytes: 820108
num_examples: 4000
- name: validation
num_bytes: 210810
num_examples: 1000
download_size: 0
dataset_size: 2083381
---
# Dataset Card for "assin_por_Latn_to_spa_Latn"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
ruanchaves
原始信息汇总
数据集概述
特征信息
- sentence_pair_id: 整数类型 (int64)
- premise: 字符串类型 (string)
- hypothesis: 字符串类型 (string)
- relatedness_score: 浮点数类型 (float32)
- entailment_judgment: 分类标签类型
- 标签名称:
- 0: NONE
- 1: ENTAILMENT
- 2: PARAPHRASE
- 标签名称:
- language: 字符串类型 (string)
数据分割
- train:
- 字节数: 1052463
- 样本数: 5000
- test:
- 字节数: 820108
- 样本数: 4000
- validation:
- 字节数: 210810
- 样本数: 1000
数据集大小
- 下载大小: 0
- 数据集大小: 2083381



