mteb/MIRACLRetrieval_th_top_250_only_w_correct
收藏Hugging Face2024-09-23 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/mteb/MIRACLRetrieval_th_top_250_only_w_correct
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个部分:corpus、default和queries。corpus部分包含了文本和标题字段,default部分包含了查询ID、语料库ID和分数字段,queries部分包含了文本字段。每个部分都有训练数据集,corpus部分的训练数据大小为约118MB,包含约116,956个示例;default部分的训练数据大小为约191KB,包含6,989个示例;queries部分的训练数据大小为约98KB,包含733个示例。
The dataset consists of three parts: corpus, default, and queries. The corpus part includes text and title fields, the default part includes query-id, corpus-id, and score fields, and the queries part includes text fields. Each part has a training dataset, with the corpus parts training data size being approximately 118MB containing 116,956 examples; the default parts training data size is approximately 191KB containing 6,989 examples; and the queries parts training data size is approximately 98KB containing 733 examples.
提供机构:
mteb



