castorini/prebuilt-indexes-msmarco-v2.1-doc-segmented
收藏Hugging Face2025-07-09 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/castorini/prebuilt-indexes-msmarco-v2.1-doc-segmented
下载链接
链接失效反馈官方服务:
资源简介:
这个数据集提供了MSMARCO-v2.1文档分割语料库的预构建索引,这些索引使用不同的编码器(例如,lucene-inverted.msmarco-v2.1-doc-segmented.splade-v3.20250707.4039c3.tar.gz是使用SPLADE-v3模型编码的)。数据集目前不完整,并将逐步更新以迁移更多的预构建索引。
This dataset provides pre-built indexes for the MSMARCO-v2.1 Doc Segmented corpus, encoded with different encoders (e.g., lucene-inverted.msmarco-v2.1-doc-segmented.splade-v3.20250707.4039c3.tar.gz is encoded with the SPLADE-v3 model). The dataset is currently incomplete and will be gradually updated to migrate more pre-built indexes.
提供机构:
castorini



