ZurichNLP/20min-XD
收藏Hugging Face2026-01-19 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/ZurichNLP/20min-XD
下载链接
链接失效反馈官方服务:
资源简介:
20min-XD是一个由瑞士新闻文章组成的可比语料库,包含德语和法语两种语言版本,收集自2015年至2024年间20 Minuten网站的在线版本。该语料库由15,000对语义对齐的文章组成,覆盖了从近似翻译到报道同一事件的相关文章的广泛跨语言相似度。此数据集仅限于非商业研究使用。
20min-XD is a comparable corpus of Swiss news articles in German and French, collected from the online editions of 20 Minuten between 2015 and 2024. The corpus consists of 15,000 semantically aligned article pairs, covering a wide range of cross-lingual similarity from near-translations to related articles on the same event. This dataset is intended for non-commercial research use only.
提供机构:
ZurichNLP



