FrancophonIA/en_fr_pl_uk_parallel_corpus
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/en_fr_pl_uk_parallel_corpus
下载链接
链接失效反馈官方服务:
资源简介:
这是一个多语言(英语、法语、波兰语、乌克兰语)的平行语料库,基于TMX文件构成,包含了77932个翻译单元(TU)。语料库中的语言对包括英语到乌克兰语、法语到乌克兰语、波兰语到乌克兰语,分别有36227、33376、8329个翻译单元。数据集经过了合并和去除重复等处理。
This is a multilingual (English, French, Polish, Ukrainian) parallel corpus based on TMX files, containing 77,932 translation units (TUs). The corpus includes language pairs of English to Ukrainian, French to Ukrainian, and Polish to Ukrainian, with 36,227, 33,376, and 8,329 translation units respectively. The dataset has been processed with merging and deduplication.
提供机构:
FrancophonIA



