nthakur/bge-retrieval-data-7-datasets-400K-removed
收藏Hugging Face2025-03-24 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/nthakur/bge-retrieval-data-7-datasets-400K-removed
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了查询ID、查询文本、正例文本段落和负例文本段落等信息。正例和负例文本段落都包含文档ID、文本内容和标题。数据集分为训练集,其大小为7194211487字节,共有343128个示例。
The dataset includes query ID, query text, positive text passages, and negative text passages. Both positive and negative passages contain document ID, text content, and title. The dataset is split into a training set, which is 7194211487 bytes in size and contains 343128 examples.
提供机构:
nthakur



