five

trumancai/msmarco-w-instruction-partial

收藏
Hugging Face2024-11-10 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/trumancai/msmarco-w-instruction-partial
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是一个单语言(英语)的文本检索数据集,主要用于文档检索任务。数据集包含三个配置:default、corpus和queries。default配置包含查询ID、语料库ID和分数特征,测试集有10000个样本。corpus配置包含ID、标题和文本特征,语料库有322675个样本。queries配置包含ID和文本特征,查询集有10000个样本。

This dataset is designed for text retrieval tasks and includes three configurations: default, corpus, and queries. The default configuration is for the test set and includes features such as query-id, corpus-id, and score. The corpus configuration is for the document set and includes features like _id, title, and text. The queries configuration is for the query set and includes features like _id and text. The purpose of the dataset is to support document retrieval tasks, specifically retrieving documents relevant to the queries from the document set.
提供机构:
trumancai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作