aarontrinh02/ms_marco_synthesis_3.1_large
收藏Hugging Face2025-04-06 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/aarontrinh02/ms_marco_synthesis_3.1_large
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个文本数据集,包含查询和指令的正反例,以及相关的文档和硬负样本文档。数据集适用于文本匹配或检索任务,其中正例表示匹配的查询和指令,反例表示不匹配的情况。硬负样本文档可能是特别挑选的与正例文档不匹配但可能具有挑战性的样本。数据集提供了一个训练集,大小约为94.3MB,共有20105个示例。
This dataset is a text dataset containing positive and negative examples of queries and instructions, along with related documents and hard negative documents. The dataset is suitable for text matching or retrieval tasks, where the positive examples represent matching queries and instructions, and the negative examples represent non-matching cases. Hard negative documents might be specifically selected samples that do not match the positive documents but are challenging. The dataset provides a training set, which is about 94.3MB in size and contains 20,105 examples.
提供机构:
aarontrinh02



