five

FrancophonIA/Europeana_English_translations

收藏
Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/Europeana_English_translations
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了来自Europeana平台的双语元数据选择,包含从21种欧洲语言到英语的双语对。为每种语言提供了TSV格式的双语训练集和单语测试集。对于段落数少于1000的语言对,仅提供了一个包含所有段的TSV文件。这些段是从Europeana数据模型的元数据属性中提取的,并且经过了语言检测、分段和清理处理。

The dataset includes a selection of bilingual metadata from the Europeana platform, containing bilingual pairs from one of 21 European languages to English. Training sets with bilingual segments in TSV format and test sets with monolingual segments are provided for each language. For language pairs with less than 1000 segments, only one TSV file containing all segments is provided. These segments are extracted from metadata properties of the Europeana Data Model and have undergone language detection, segmentation, and cleaning processes.
提供机构:
FrancophonIA
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作