nllg/wmt-metrics-data
收藏Hugging Face2023-09-04 更新2024-06-15 收录
下载链接:
https://hf-mirror.com/datasets/nllg/wmt-metrics-data
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
- config_name: mt0-labse
data_files:
- split: test
path: mt0-labse/test-*
- split: train
path: mt0-labse/train-*
dataset_info:
- config_name: default
features:
- name: lp
dtype: string
- name: src
dtype: string
- name: mt
dtype: string
- name: ref
dtype: string
- name: score
dtype: float64
- name: score_type
dtype: string
splits:
- name: train
num_bytes: 718521119
num_examples: 1527567
- name: test
num_bytes: 31104504
num_examples: 77575
download_size: 0
dataset_size: 749625623
- config_name: mt0-labse
features:
- name: labels
dtype: float64
- name: input_ids
sequence: int32
- name: attention_mask
sequence: int8
- name: labse
sequence: float32
splits:
- name: test
num_bytes: 751787050.0
num_examples: 77575
- name: train
num_bytes: 14966913687.0
num_examples: 1527567
download_size: 16284469907
dataset_size: 15718700737.0
---
# Dataset Card for "wmt-metrics-data"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
nllg
原始信息汇总
数据集概述
配置信息
默认配置 (default)
- 数据文件路径:
- 训练集:
data/train-* - 测试集:
data/test-*
- 训练集:
- 特征:
lp: 字符串类型src: 字符串类型mt: 字符串类型ref: 字符串类型score: 浮点数类型score_type: 字符串类型
- 数据分割:
- 训练集: 718521119 字节, 1527567 个样本
- 测试集: 31104504 字节, 77575 个样本
- 数据集大小: 749625623 字节
mt0-labse 配置 (mt0-labse)
- 数据文件路径:
- 训练集:
mt0-labse/train-* - 测试集:
mt0-labse/test-*
- 训练集:
- 特征:
labels: 浮点数类型input_ids: 整数序列attention_mask: 整数序列labse: 浮点数序列
- 数据分割:
- 训练集: 14966913687 字节, 1527567 个样本
- 测试集: 751787050 字节, 77575 个样本
- 数据集大小: 15718700737 字节
- 下载大小: 16284469907 字节



