five

mFollowIR-parquet-mteb

收藏
魔搭社区2025-09-16 更新2025-09-13 收录
下载链接:
https://modelscope.cn/datasets/jhu-clsp/mFollowIR-parquet-mteb
下载链接
链接失效反馈
官方服务:
资源简介:
# mFollowIR-mteb This is a new version of the mFollowIR dataset modified to fit the new MTEB format. 1. Restructured queries to include both original and changed versions 2. Separated instructions into a dedicated configuration 3. Reorganized qrels into default (original) and qrel_diff configurations ## Dataset Structure The dataset contains the following configurations: ### Language: fas - corpus-fas: Original corpus documents - queries-fas: Queries with both original and changed versions - instruction-fas: Instructions for both original and changed queries - default-fas: Original relevance judgments - qrel_diff-fas: Changes in relevance judgments - top_ranked-fas: Top ranked documents for each query ### Language: rus - corpus-rus: Original corpus documents - queries-rus: Queries with both original and changed versions - instruction-rus: Instructions for both original and changed queries - default-rus: Original relevance judgments - qrel_diff-rus: Changes in relevance judgments - top_ranked-rus: Top ranked documents for each query ### Language: zho - corpus-zho: Original corpus documents - queries-zho: Queries with both original and changed versions - instruction-zho: Instructions for both original and changed queries - default-zho: Original relevance judgments - qrel_diff-zho: Changes in relevance judgments - top_ranked-zho: Top ranked documents for each query

# mFollowIR-mteb 数据集 本数据集为适配新版MTEB格式而修改得到的mFollowIR数据集新版本。 1. 重构查询内容,同时保留原始版本与修改后的版本 2. 将指令分离至独立配置项中 3. 将相关性判断文件(qrels)重组为默认(原始版本)与qrel_diff两种配置项 ## 数据集结构 本数据集包含以下配置项: ### 语言:fas(波斯语) - corpus-fas:原始语料库文档 - queries-fas:同时包含原始版本与修改后版本的查询 - instruction-fas:适配原始与修改后查询的指令 - default-fas:原始相关性判断结果 - qrel_diff-fas:相关性判断结果的变更项 - top_ranked-fas:各查询对应的Top排序文档 ### 语言:rus(俄语) - corpus-rus:原始语料库文档 - queries-rus:同时包含原始版本与修改后版本的查询 - instruction-rus:适配原始与修改后查询的指令 - default-rus:原始相关性判断结果 - qrel_diff-rus:相关性判断结果的变更项 - top_ranked-rus:各查询对应的Top排序文档 ### 语言:zho(中文) - corpus-zho:原始语料库文档 - queries-zho:同时包含原始版本与修改后版本的查询 - instruction-zho:适配原始与修改后查询的指令 - default-zho:原始相关性判断结果 - qrel_diff-zho:相关性判断结果的变更项 - top_ranked-zho:各查询对应的Top排序文档
提供机构:
maas
创建时间:
2025-09-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作