JRC-Acquis corpus
收藏arXiv2025-09-30 收录
下载链接:
https://ec.europa.eu/jrc/en/language-technologies/jrc-acquis
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为JRC-Acquis,包含了适用于欧盟成员国的所有欧盟法律文本。该数据集由Gu等人于2018年进行了预处理,这为确保与先前BLEU分数的公平比较提供了条件。数据集包含了四个方向的翻译任务:西班牙语到英语、英语到西班牙语、德语到英语以及英语到德语的翻译任务,旨在用于神经机器翻译研究。
The dataset named JRC-Acquis encompasses all European Union (EU) legal texts applicable to EU member states. This dataset was preprocessed by Gu et al. in 2018, which enables fair comparisons with prior BLEU scores. It includes four translation directions: Spanish-to-English, English-to-Spanish, German-to-English, and English-to-German, and is intended for neural machine translation research.
提供机构:
JRC (Joint Research Centre)



