seongil-dn/mteb-nq-open
收藏Hugging Face2025-02-27 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/seongil-dn/mteb-nq-open
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个部分:语料库(corpus)、查询与语料对应关系(default)和查询语句(queries)。语料库部分包含超过277万个文本示例,查询与语料对应关系部分包含约10万个查询与语料ID及分数的对应记录,查询语句部分包含约10万个查询文本。每个部分都有其特定的特征,如语料库部分包含文本内容和唯一标识符,查询与语料对应关系部分包含查询ID、语料ID和分数,查询语句部分包含查询文本和唯一标识符。
The dataset consists of three parts: a corpus (corpus), a query-corpus correspondence (default), and query statements (queries). The corpus part contains over 2.77 million text examples, the query-corpus correspondence part contains about 100,000 records of query and corpus IDs along with scores, and the query statements part contains about 100,000 query texts. Each part has its specific features, such as the corpus part includes text content and a unique identifier, the query-corpus correspondence part includes query ID, corpus ID, and score, and the query statements part includes query text and a unique identifier.
提供机构:
seongil-dn



