five

FrancophonIA/PRINCIPLE_MVEP

收藏
Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/PRINCIPLE_MVEP
下载链接
链接失效反馈
官方服务:
资源简介:
PRINCIPLE MVEP 克罗地亚-英语 法律文件平行语料库包含400个文件(4个TMX文件和198个克罗地亚语和198个英语文本文件),总计113,685个翻译单元。它主要包含克罗地亚法律的修订翻译和欧盟法院判决。其中一个TMX文件除了克罗地亚语和英语外,还包含法语翻译。文件经过清理,并对样本进行了手动内容检查。进行了自动翻译单元对齐,随后对样本进行了手动对齐检查。该语料库在PSI许可证下开放和免费提供。

The PRINCIPLE MVEP Croatian-English Parallel Corpus of legal documents contains 400 documents (4 TMX files, and 198 text files in Croatian and 198 text files in English) totaling 113,685 translation units. It contains mostly revised translations of Croatian legislation and judgements of the Court of Justice of the EU. One TMX file contains the French translation in addition to Croatian and English. Documents were cleaned, and a manual content check was performed on a sample. Automatic TU alignment was performed, followed by a manual check of alignment on a sample. It is open and freely available under the PSI licence.
提供机构:
FrancophonIA
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作