fredxlpy/LuxAlign
收藏Hugging Face2024-12-17 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/fredxlpy/LuxAlign
下载链接
链接失效反馈官方服务:
资源简介:
LuxAlign是一个包含卢森堡语-英语和卢森堡语-法语句子对的平行数据集,旨在通过跨语言方法增强卢森堡语句子嵌入。该数据集来源于卢森堡新闻平台RTL.lu发布的新闻文章,句子对并非总是精确翻译,而是反映高语义相似性。因此,该数据集可能不适合直接用于训练机器翻译模型。
LuxAlign is a parallel dataset featuring Luxembourgish-English and Luxembourgish-French sentence pairs, designed to align the Luxembourgish embedding space with those of other languages, enabling improved cross-lingual sentence representations for Luxemborgish. The data originates from news articles published by the Luxembourgish news platform RTL.lu. The sentence pairs in this dataset are not always exact translations but instead reflect high semantic similarity; hence, this dataset may not be suitable for training a machine translation model without caution.
提供机构:
fredxlpy



