mteb/TopiOCQA_validation_top_250_only_w_correct-v2
收藏Hugging Face2025-05-04 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/mteb/TopiOCQA_validation_top_250_only_w_correct-v2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个部分:corpus、default和queries。corpus部分包含带有标题的文本数据,default部分包含查询ID、语料库ID和分数信息,queries部分包含查询文本。每个部分都有一个验证集,分别提供了字节数和示例数。corpus验证集的字节数为45379287.27612146字节,包含89933个示例;default验证集的字节数为33284.407319013524字节,包含1000个示例;queries验证集的字节数为686766.1097852029字节,也包含1000个示例。
The dataset consists of three parts: corpus, default, and queries. The corpus section contains text data with titles, the default section contains query IDs, corpus IDs, and score information, and the queries section contains query texts. Each part has a validation set, which provides the number of bytes and examples. The corpus validation set is 45379287.27612146 bytes in size and contains 89933 examples; the default validation set is 33284.407319013524 bytes in size and contains 1000 examples; the queries validation set is 686766.1097852029 bytes in size and also contains 1000 examples.
提供机构:
mteb



