hgissbkh/ms_marco
收藏Hugging Face2025-03-14 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/hgissbkh/ms_marco
下载链接
链接失效反馈官方服务:
资源简介:
这个数据集包含了四个部分:答案(answers)、语料库(corpus)、查询与相关性(qrels)和查询(queries)。答案部分包含查询ID和对应的答案序列;语料库部分包含文档ID和文档内容;查询与相关性部分包含查询ID以及与之相关的正负样本ID序列;查询部分包含查询ID、查询文本和查询类型。数据集分为训练集、验证集和测试集三个部分,每个部分都有相应的大小和示例数量。
The dataset consists of four parts: answers, corpus, qrels, and queries. The answers part includes query IDs and corresponding answer sequences; the corpus part includes document IDs and document contents; the qrels part includes query IDs and sequences of positive and negative sample IDs; the queries part includes query IDs, query text, and query types. The dataset is divided into three parts: training set, validation set, and test set, each with its size and number of examples.
提供机构:
hgissbkh



