tomaarsen/NanoFiQA2018-bm25
收藏Hugging Face2025-02-03 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/tomaarsen/NanoFiQA2018-bm25
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个部分:corpus(语料库)、queries(查询)和relevance(相关性)。语料库部分包含文本数据,查询部分包含查询语句,相关性部分包含查询与语料之间的相关性信息,包括正相关的语料ID和根据BM25算法排序的语料ID。数据集包含训练集,可用于训练相关模型。
The dataset consists of three parts: corpus, queries, and relevance. The corpus part contains text data, the queries part contains query statements, and the relevance part contains information about the relevance between queries and corpus, including positive corpus IDs and corpus IDs ranked by the BM25 algorithm. The dataset includes a training set, which can be used to train relevant models.
提供机构:
tomaarsen



