GreenNode/msmarco-vn
收藏Hugging Face2026-01-08 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/GreenNode/msmarco-vn
下载链接
链接失效反馈官方服务:
资源简介:
这是一个越南语的文本检索数据集,包含corpus、default和queries三个配置。corpus配置有标题、文本等字段,default配置包含查询ID、语料库ID和分数等字段,queries配置包含文本和原始文本字段。数据集分为训练、开发和测试集,来源于mteb/msmarco数据集,并可以使用mteb库进行模型评估。
This is a Vietnamese text retrieval dataset with three configurations: corpus, default, and queries. The corpus configuration includes fields like title and text, the default configuration includes fields like query-id, corpus-id, and score, and the queries configuration includes text and og_text fields. The dataset is split into training, development, and test sets, sourced from the mteb/msmarco dataset, and can be evaluated using the mteb library.
提供机构:
GreenNode



