mFollowIR-parquet-mteb
收藏魔搭社区2025-09-16 更新2025-09-13 收录
下载链接:
https://modelscope.cn/datasets/jhu-clsp/mFollowIR-parquet-mteb
下载链接
链接失效反馈官方服务:
资源简介:
# mFollowIR-mteb
This is a new version of the mFollowIR dataset modified to fit the new MTEB format.
1. Restructured queries to include both original and changed versions
2. Separated instructions into a dedicated configuration
3. Reorganized qrels into default (original) and qrel_diff configurations
## Dataset Structure
The dataset contains the following configurations:
### Language: fas
- corpus-fas: Original corpus documents
- queries-fas: Queries with both original and changed versions
- instruction-fas: Instructions for both original and changed queries
- default-fas: Original relevance judgments
- qrel_diff-fas: Changes in relevance judgments
- top_ranked-fas: Top ranked documents for each query
### Language: rus
- corpus-rus: Original corpus documents
- queries-rus: Queries with both original and changed versions
- instruction-rus: Instructions for both original and changed queries
- default-rus: Original relevance judgments
- qrel_diff-rus: Changes in relevance judgments
- top_ranked-rus: Top ranked documents for each query
### Language: zho
- corpus-zho: Original corpus documents
- queries-zho: Queries with both original and changed versions
- instruction-zho: Instructions for both original and changed queries
- default-zho: Original relevance judgments
- qrel_diff-zho: Changes in relevance judgments
- top_ranked-zho: Top ranked documents for each query
# mFollowIR-mteb 数据集
本数据集为适配新版MTEB格式而修改得到的mFollowIR数据集新版本。
1. 重构查询内容,同时保留原始版本与修改后的版本
2. 将指令分离至独立配置项中
3. 将相关性判断文件(qrels)重组为默认(原始版本)与qrel_diff两种配置项
## 数据集结构
本数据集包含以下配置项:
### 语言:fas(波斯语)
- corpus-fas:原始语料库文档
- queries-fas:同时包含原始版本与修改后版本的查询
- instruction-fas:适配原始与修改后查询的指令
- default-fas:原始相关性判断结果
- qrel_diff-fas:相关性判断结果的变更项
- top_ranked-fas:各查询对应的Top排序文档
### 语言:rus(俄语)
- corpus-rus:原始语料库文档
- queries-rus:同时包含原始版本与修改后版本的查询
- instruction-rus:适配原始与修改后查询的指令
- default-rus:原始相关性判断结果
- qrel_diff-rus:相关性判断结果的变更项
- top_ranked-rus:各查询对应的Top排序文档
### 语言:zho(中文)
- corpus-zho:原始语料库文档
- queries-zho:同时包含原始版本与修改后版本的查询
- instruction-zho:适配原始与修改后查询的指令
- default-zho:原始相关性判断结果
- qrel_diff-zho:相关性判断结果的变更项
- top_ranked-zho:各查询对应的Top排序文档
提供机构:
maas
创建时间:
2025-09-10



