FrancophonIA/NTEU_French-Bulgarian
收藏Hugging Face2025-03-29 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/NTEU_French-Bulgarian
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于构建机器翻译引擎的平行语料库资源汇编,属于NTEU项目(项目编号:2018-EU-IA-0051)。该资源中的数据被编译为两个TMX文件,根据数据源可靠性分为两个层级:A层和B层。A层包括来自人工编辑源的数据,如翻译记忆库等;B层包括通过自动对齐来自各种网络和平行文档源的数据。数据集包括来自不同网站和资源的平行语料库,如IATE术语、JRC-Acquis、EAC-TM、ECDC-TM、DGT-TM、全球之声、EU-Bookshop、OPUS-EMEA和Europarl v6等。
This is a compilation of parallel corpora resources used in building Machine Translation engines for the NTEU project (Action number: 2018-EU-IA-0051). The data in this resource are compiled into two TMX files, categorized into two tiers based on data source reliability: Tier A includes data from human-edited sources such as translation memories, and Tier B includes data created by automatic alignment of parallel data from various web and parallel document sources. The dataset includes parallel corpora from different websites and resources, such as IATE Terminology, JRC-Acquis, EAC-TM, ECDC-TM, DGT-TM, Global Voices, EU-Bookshop, OPUS-EMEA, and Europarl v6.
提供机构:
FrancophonIA



