wygao8/13B-LoRA-DP0.8-BG5
收藏Hugging Face2024-07-08 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/wygao8/13B-LoRA-DP0.8-BG5
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: cs-en
features:
- name: translation
struct:
- name: alma_cs
sequence: string
- name: alma_cs_kiwi-xl
sequence: float64
- name: alma_en
sequence: string
- name: alma_en_kiwi-xl
sequence: float64
- name: cs
dtype: string
- name: en
dtype: string
- name: language_pair
dtype: string
- name: required_directions
dtype: string
splits:
- name: train
num_bytes: 6365269
num_examples: 2009
download_size: 2029691
dataset_size: 6365269
- config_name: de-en
features:
- name: translation
struct:
- name: alma_de
sequence: string
- name: alma_de_kiwi-xl
sequence: float64
- name: alma_en
sequence: string
- name: alma_en_kiwi-xl
sequence: float64
- name: de
dtype: string
- name: en
dtype: string
- name: language_pair
dtype: string
- name: required_directions
dtype: string
splits:
- name: train
num_bytes: 6681707
num_examples: 2009
download_size: 2027552
dataset_size: 6681707
- config_name: is-en
features:
- name: translation
struct:
- name: alma_en
sequence: string
- name: alma_en_kiwi-xl
sequence: float64
- name: alma_is
sequence: string
- name: alma_is_kiwi-xl
sequence: float64
- name: en
dtype: string
- name: is
dtype: string
- name: language_pair
dtype: string
- name: required_directions
dtype: string
splits:
- name: train
num_bytes: 6455302
num_examples: 2009
download_size: 2027946
dataset_size: 6455302
- config_name: ru-en
features:
- name: translation
struct:
- name: alma_en
sequence: string
- name: alma_en_kiwi-xl
sequence: float64
- name: alma_ru
sequence: string
- name: alma_ru_kiwi-xl
sequence: float64
- name: en
dtype: string
- name: language_pair
dtype: string
- name: required_directions
dtype: string
- name: ru
dtype: string
splits:
- name: train
num_bytes: 8914985
num_examples: 2009
download_size: 2392368
dataset_size: 8914985
- config_name: zh-en
features:
- name: translation
struct:
- name: alma_en
sequence: string
- name: alma_en_kiwi-xl
sequence: float64
- name: alma_zh
sequence: string
- name: alma_zh_kiwi-xl
sequence: float64
- name: en
dtype: string
- name: language_pair
dtype: string
- name: required_directions
dtype: string
- name: zh
dtype: string
splits:
- name: train
num_bytes: 5917870
num_examples: 2009
download_size: 1971028
dataset_size: 5917870
configs:
- config_name: cs-en
data_files:
- split: train
path: cs-en/train-*
- config_name: de-en
data_files:
- split: train
path: de-en/train-*
- config_name: is-en
data_files:
- split: train
path: is-en/train-*
- config_name: ru-en
data_files:
- split: train
path: ru-en/train-*
- config_name: zh-en
data_files:
- split: train
path: zh-en/train-*
---
# Dataset Card for "13B-LoRA-DP0.8-BG5"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
wygao8
原始信息汇总
数据集概述
数据集配置
cs-en
- 特征:
translation:alma_cs: 字符串序列alma_cs_kiwi-xl: 浮点数序列alma_en: 字符串序列alma_en_kiwi-xl: 浮点数序列cs: 字符串en: 字符串language_pair: 字符串required_directions: 字符串
- 分割:
train:- 字节数: 6365269
- 样本数: 2009
- 下载大小: 2029691 字节
- 数据集大小: 6365269 字节
de-en
- 特征:
translation:alma_de: 字符串序列alma_de_kiwi-xl: 浮点数序列alma_en: 字符串序列alma_en_kiwi-xl: 浮点数序列de: 字符串en: 字符串language_pair: 字符串required_directions: 字符串
- 分割:
train:- 字节数: 6681707
- 样本数: 2009
- 下载大小: 2027552 字节
- 数据集大小: 6681707 字节
is-en
- 特征:
translation:alma_en: 字符串序列alma_en_kiwi-xl: 浮点数序列alma_is: 字符串序列alma_is_kiwi-xl: 浮点数序列en: 字符串is: 字符串language_pair: 字符串required_directions: 字符串
- 分割:
train:- 字节数: 6455302
- 样本数: 2009
- 下载大小: 2027946 字节
- 数据集大小: 6455302 字节
ru-en
- 特征:
translation:alma_en: 字符串序列alma_en_kiwi-xl: 浮点数序列alma_ru: 字符串序列alma_ru_kiwi-xl: 浮点数序列en: 字符串language_pair: 字符串required_directions: 字符串ru: 字符串
- 分割:
train:- 字节数: 8914985
- 样本数: 2009
- 下载大小: 2392368 字节
- 数据集大小: 8914985 字节
zh-en
- 特征:
translation:alma_en: 字符串序列alma_en_kiwi-xl: 浮点数序列alma_zh: 字符串序列alma_zh_kiwi-xl: 浮点数序列en: 字符串language_pair: 字符串required_directions: 字符串zh: 字符串
- 分割:
train:- 字节数: 5917870
- 样本数: 2009
- 下载大小: 1971028 字节
- 数据集大小: 5917870 字节



