withpi/msmarco-mnrl-hard-triplets-grouped-80s-20m-joined_qwen3_embedding_tokenized_8k_5_embedding
收藏Hugging Face2025-08-08 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/withpi/msmarco-mnrl-hard-triplets-grouped-80s-20m-joined_qwen3_embedding_tokenized_8k_5_embedding
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个查询语句(query_1至query_4)、正例段落和反例段落,以及它们的哈希值,类别信息,查询数量,输入ID和注意力掩码等特征。数据集分为训练集和测试集,提供了每个分集的示例数量和文件大小信息。
The dataset includes multiple query statements (query_1 to query_4), positive and negative passages along with their hashes, category information, query count, input IDs, and attention masks. The dataset is split into training and test sets, with information provided on the number of examples and file size for each split.
提供机构:
withpi



