nthakur/bge-retrieval-data-ivf-pruning-438K
收藏Hugging Face2025-03-10 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/nthakur/bge-retrieval-data-ivf-pruning-438K
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了查询ID、查询内容、正例段落和负例段落等信息。正例段落和负例段落均包含文档ID、文本内容和标题。数据集分为训练集,其大小为7390130389.026834字节,共有441625个示例。数据集的下载大小为4320127664字节。
The dataset includes query ID, query text, positive passages, and negative passages. Both positive and negative passages contain document ID, text, and title. The dataset is split into a training set, which is 7390130389.026834 bytes in size and contains 441625 examples. The download size of the dataset is 4320127664 bytes.
提供机构:
nthakur



