timpearce/alpaca-cleaned-opus-mt-en-fr
收藏Hugging Face2024-05-24 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/timpearce/alpaca-cleaned-opus-mt-en-fr
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: source_output
dtype: string
- name: source_input
dtype: string
- name: source_instruction
dtype: string
- name: output
dtype: string
- name: rouge1_output
dtype: float64
- name: rouge2_output
dtype: float64
- name: rougeL_output
dtype: float64
- name: input
dtype: string
- name: rouge1_input
dtype: float64
- name: rouge2_input
dtype: float64
- name: rougeL_input
dtype: float64
- name: instruction
dtype: string
- name: rouge1_instruction
dtype: float64
- name: rouge2_instruction
dtype: float64
- name: rougeL_instruction
dtype: float64
splits:
- name: train
num_bytes: 94282103
num_examples: 51760
download_size: 53430793
dataset_size: 94282103
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
This dataset contains multiple text features and evaluation metrics for training and evaluating the performance of text generation models. Features include source output, source input, and source instruction, along with corresponding evaluation metrics such as rouge1, rouge2, and rougeL. The dataset is divided into a training set with 51760 samples, with a download size of 53430793 bytes and a total size of 94282103 bytes.
提供机构:
timpearce
原始信息汇总
数据集概述
数据集特征
- source_output: 数据类型 - string
- source_input: 数据类型 - string
- source_instruction: 数据类型 - string
- output: 数据类型 - string
- rouge1_output: 数据类型 - float64
- rouge2_output: 数据类型 - float64
- rougeL_output: 数据类型 - float64
- input: 数据类型 - string
- rouge1_input: 数据类型 - float64
- rouge2_input: 数据类型 - float64
- rougeL_input: 数据类型 - float64
- instruction: 数据类型 - string
- rouge1_instruction: 数据类型 - float64
- rouge2_instruction: 数据类型 - float64
- rougeL_instruction: 数据类型 - float64
数据集分割
- train:
- 数据大小: 94282103 字节
- 示例数量: 51760
数据集大小
- 下载大小: 53430793 字节
- 数据集大小: 94282103 字节
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*



