wecover/OPUS_Europarl
收藏Hugging Face2024-01-31 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/wecover/OPUS_Europarl
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: '*/*/train.parquet'
- split: valid
path: '*/*/valid.parquet'
- split: test
path: '*/*/test.parquet'
- config_name: bg
data_files:
- split: train
path: '*/*bg*/train.parquet'
- split: test
path: '*/*bg*/test.parquet'
- split: valid
path: '*/*bg*/valid.parquet'
- config_name: cs
data_files:
- split: train
path: '*/*cs*/train.parquet'
- split: test
path: '*/*cs*/test.parquet'
- split: valid
path: '*/*cs*/valid.parquet'
- config_name: da
data_files:
- split: train
path: '*/*da*/train.parquet'
- split: test
path: '*/*da*/test.parquet'
- split: valid
path: '*/*da*/valid.parquet'
- config_name: de
data_files:
- split: train
path: '*/*de*/train.parquet'
- split: test
path: '*/*de*/test.parquet'
- split: valid
path: '*/*de*/valid.parquet'
- config_name: el
data_files:
- split: train
path: '*/*el*/train.parquet'
- split: test
path: '*/*el*/test.parquet'
- split: valid
path: '*/*el*/valid.parquet'
- config_name: en
data_files:
- split: train
path: '*/*en*/train.parquet'
- split: test
path: '*/*en*/test.parquet'
- split: valid
path: '*/*en*/valid.parquet'
- config_name: es
data_files:
- split: train
path: '*/*es*/train.parquet'
- split: test
path: '*/*es*/test.parquet'
- split: valid
path: '*/*es*/valid.parquet'
- config_name: et
data_files:
- split: train
path: '*/*et*/train.parquet'
- split: test
path: '*/*et*/test.parquet'
- split: valid
path: '*/*et*/valid.parquet'
- config_name: fi
data_files:
- split: train
path: '*/*fi*/train.parquet'
- split: test
path: '*/*fi*/test.parquet'
- split: valid
path: '*/*fi*/valid.parquet'
- config_name: fr
data_files:
- split: train
path: '*/*fr*/train.parquet'
- split: test
path: '*/*fr*/test.parquet'
- split: valid
path: '*/*fr*/valid.parquet'
- config_name: hu
data_files:
- split: train
path: '*/*hu*/train.parquet'
- split: test
path: '*/*hu*/test.parquet'
- split: valid
path: '*/*hu*/valid.parquet'
- config_name: it
data_files:
- split: train
path: '*/*it*/train.parquet'
- split: test
path: '*/*it*/test.parquet'
- split: valid
path: '*/*it*/valid.parquet'
- config_name: lt
data_files:
- split: train
path: '*/*lt*/train.parquet'
- split: test
path: '*/*lt*/test.parquet'
- split: valid
path: '*/*lt*/valid.parquet'
- config_name: nl
data_files:
- split: train
path: '*/*nl*/train.parquet'
- split: test
path: '*/*nl*/test.parquet'
- split: valid
path: '*/*nl*/valid.parquet'
- config_name: pl
data_files:
- split: train
path: '*/*pl*/train.parquet'
- split: test
path: '*/*pl*/test.parquet'
- split: valid
path: '*/*pl*/valid.parquet'
- config_name: pt
data_files:
- split: train
path: '*/*pt*/train.parquet'
- split: test
path: '*/*pt*/test.parquet'
- split: valid
path: '*/*pt*/valid.parquet'
- config_name: ro
data_files:
- split: train
path: '*/*ro*/train.parquet'
- split: test
path: '*/*ro*/test.parquet'
- split: valid
path: '*/*ro*/valid.parquet'
- config_name: sk
data_files:
- split: train
path: '*/*sk*/train.parquet'
- split: test
path: '*/*sk*/test.parquet'
- split: valid
path: '*/*sk*/valid.parquet'
- config_name: sl
data_files:
- split: train
path: '*/*sl*/train.parquet'
- split: test
path: '*/*sl*/test.parquet'
- split: valid
path: '*/*sl*/valid.parquet'
- config_name: sv
data_files:
- split: train
path: '*/*sv*/train.parquet'
- split: test
path: '*/*sv*/test.parquet'
- split: valid
path: '*/*sv*/valid.parquet'
---
提供机构:
wecover
原始信息汇总
数据集概述
配置信息
默认配置
- 训练集:
*/*/train.parquet - 验证集:
*/*/valid.parquet - 测试集:
*/*/test.parquet
语言特定配置
以下是按语言分类的配置及其对应的数据文件路径:
-
保加利亚语 (bg)
- 训练集:
*/*bg*/train.parquet - 验证集:
*/*bg*/valid.parquet - 测试集:
*/*bg*/test.parquet
- 训练集:
-
捷克语 (cs)
- 训练集:
*/*cs*/train.parquet - 验证集:
*/*cs*/valid.parquet - 测试集:
*/*cs*/test.parquet
- 训练集:
-
丹麦语 (da)
- 训练集:
*/*da*/train.parquet - 验证集:
*/*da*/valid.parquet - 测试集:
*/*da*/test.parquet
- 训练集:
-
德语 (de)
- 训练集:
*/*de*/train.parquet - 验证集:
*/*de*/valid.parquet - 测试集:
*/*de*/test.parquet
- 训练集:
-
希腊语 (el)
- 训练集:
*/*el*/train.parquet - 验证集:
*/*el*/valid.parquet - 测试集:
*/*el*/test.parquet
- 训练集:
-
英语 (en)
- 训练集:
*/*en*/train.parquet - 验证集:
*/*en*/valid.parquet - 测试集:
*/*en*/test.parquet
- 训练集:
-
西班牙语 (es)
- 训练集:
*/*es*/train.parquet - 验证集:
*/*es*/valid.parquet - 测试集:
*/*es*/test.parquet
- 训练集:
-
爱沙尼亚语 (et)
- 训练集:
*/*et*/train.parquet - 验证集:
*/*et*/valid.parquet - 测试集:
*/*et*/test.parquet
- 训练集:
-
芬兰语 (fi)
- 训练集:
*/*fi*/train.parquet - 验证集:
*/*fi*/valid.parquet - 测试集:
*/*fi*/test.parquet
- 训练集:
-
法语 (fr)
- 训练集:
*/*fr*/train.parquet - 验证集:
*/*fr*/valid.parquet - 测试集:
*/*fr*/test.parquet
- 训练集:
-
匈牙利语 (hu)
- 训练集:
*/*hu*/train.parquet - 验证集:
*/*hu*/valid.parquet - 测试集:
*/*hu*/test.parquet
- 训练集:
-
意大利语 (it)
- 训练集:
*/*it*/train.parquet - 验证集:
*/*it*/valid.parquet - 测试集:
*/*it*/test.parquet
- 训练集:
-
立陶宛语 (lt)
- 训练集:
*/*lt*/train.parquet - 验证集:
*/*lt*/valid.parquet - 测试集:
*/*lt*/test.parquet
- 训练集:
-
荷兰语 (nl)
- 训练集:
*/*nl*/train.parquet - 验证集:
*/*nl*/valid.parquet - 测试集:
*/*nl*/test.parquet
- 训练集:
-
波兰语 (pl)
- 训练集:
*/*pl*/train.parquet - 验证集:
*/*pl*/valid.parquet - 测试集:
*/*pl*/test.parquet
- 训练集:
-
葡萄牙语 (pt)
- 训练集:
*/*pt*/train.parquet - 验证集:
*/*pt*/valid.parquet - 测试集:
*/*pt*/test.parquet
- 训练集:
-
罗马尼亚语 (ro)
- 训练集:
*/*ro*/train.parquet - 验证集:
*/*ro*/valid.parquet - 测试集:
*/*ro*/test.parquet
- 训练集:
-
斯洛伐克语 (sk)
- 训练集:
*/*sk*/train.parquet - 验证集:
*/*sk*/valid.parquet - 测试集:
*/*sk*/test.parquet
- 训练集:
-
斯洛文尼亚语 (sl)
- 训练集:
*/*sl*/train.parquet - 验证集:
*/*sl*/valid.parquet - 测试集:
*/*sl*/test.parquet
- 训练集:
-
瑞典语 (sv)
- 训练集:
*/*sv*/train.parquet - 验证集:
*/*sv*/valid.parquet - 测试集:
*/*sv*/test.parquet
- 训练集:



