mekaneeky/nllb_lug_en_vigorous_clean
收藏Hugging Face2023-03-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/mekaneeky/nllb_lug_en_vigorous_clean
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: translation
dtype:
translation:
languages:
- eng_Latn
- lug_Latn
- name: laser_score
dtype: float32
- name: source_sentence_lid
dtype: float32
- name: target_sentence_lid
dtype: float32
- name: source_sentence_source
dtype: string
- name: source_sentence_url
dtype: string
- name: target_sentence_source
dtype: string
- name: target_sentence_url
dtype: string
splits:
- name: train
num_bytes: 44321608.60547476
num_examples: 94114
download_size: 21686104
dataset_size: 44321608.60547476
---
# Dataset Card for "nllb_lug_en_vigorous_clean"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
mekaneeky
原始信息汇总
数据集概述
数据集名称
- 名称: nllb_lug_en_vigorous_clean
数据集特征
- translation:
- 数据类型:
- 语言: eng_Latn, lug_Latn
- 数据类型:
- laser_score:
- 数据类型: float32
- source_sentence_lid:
- 数据类型: float32
- target_sentence_lid:
- 数据类型: float32
- source_sentence_source:
- 数据类型: string
- source_sentence_url:
- 数据类型: string
- target_sentence_source:
- 数据类型: string
- target_sentence_url:
- 数据类型: string
数据集分割
- train:
- 数据大小: 44321608.60547476 字节
- 示例数量: 94114
数据集大小
- 下载大小: 21686104 字节
- 总大小: 44321608.60547476 字节



