Lulu19971017/wmt24pp
收藏Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Lulu19971017/wmt24pp
下载链接
链接失效反馈官方服务:
资源简介:
WMT24++数据集是一个用于机器翻译任务的多语言数据集,包含从英语到55种其他语言和方言的翻译数据。数据集覆盖了多个领域,包括新闻、社交媒体、演讲和文学等。每条数据记录包含源文本、后编辑的目标文本和原始目标文本,以及相关的元数据如语言对标识、领域和文本质量标记。数据集旨在支持多语言机器翻译的研究和评估。
The WMT24++ dataset is a multilingual dataset for machine translation tasks, containing translation data from English to 55 other languages and dialects. The dataset covers multiple domains, including news, social media, speech, and literature. Each data record includes the source text, post-edited target text, original target text, and related metadata such as language pair identifiers, domains, and text quality markers. The dataset is designed to support research and evaluation in multilingual machine translation.
提供机构:
Lulu19971017



