mteb/CQADupstack-Webmasters-PL
收藏Hugging Face2025-05-04 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/mteb/CQADupstack-Webmasters-PL
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个部分:文本语料库(corpus)、默认配置(default)和查询数据(queries)。文本语料库部分包含文档的唯一标识符和文本内容。默认配置部分包含查询的唯一标识符、对应文本的唯一标识符和分数。查询数据部分包含查询的唯一标识符和查询文本。每个部分都有对应的测试集,以及测试集的字节数和示例数。
The dataset consists of three parts: text corpus (corpus), default configuration (default), and query data (queries). The text corpus part includes unique identifiers and text content of documents. The default configuration part includes unique identifiers of queries, unique identifiers of corresponding texts, and scores. The query data part includes unique identifiers of queries and the text of the queries. Each part has its corresponding test set with the number of bytes and examples.
提供机构:
mteb



