irds/lotte_recreation_test_search
收藏Hugging Face2023-01-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/irds/lotte_recreation_test_search
下载链接
链接失效反馈官方服务:
资源简介:
`lotte/recreation/test/search`数据集由ir-datasets包提供,主要用于文本检索任务。该数据集包含924个查询(topics)和1,991个相关度评估(qrels)。文档部分需要从`irds/lotte_recreation_test`数据集中获取。数据集的使用示例展示了如何通过`load_dataset`函数加载查询和相关度评估数据。
---
展示名称:`lotte/recreation/test/search`
查看器:已禁用
源数据集:['irds/lotte_recreation_test']
任务类别:文本检索(text-retrieval)
---
# 「`lotte/recreation/test/search`」数据集卡片
本数据集`lotte/recreation/test/search`由[ir-datasets](https://ir-datasets.com/)工具包提供。如需了解该数据集的详细信息,请查阅[官方文档](https://ir-datasets.com/lotte#lotte/recreation/test/search)。
## 数据说明
本数据集包含以下内容:
- `查询集(即主题集)`:共计924条
- `相关性标注集(qrels)`:共计1991条
如需获取`文档集`,请使用 [`irds/lotte_recreation_test`](https://huggingface.co/datasets/irds/lotte_recreation_test)。
## 使用方法
python
from datasets import load_dataset
queries = load_dataset('irds/lotte_recreation_test_search', 'queries')
for record in queries:
record # {'query_id': ..., 'text': ...}
qrels = load_dataset('irds/lotte_recreation_test_search', 'qrels')
for record in qrels:
record # {'query_id': ..., 'doc_id': ..., 'relevance': ..., 'iteration': ...}
请注意,调用`load_dataset`函数将自动下载该数据集(若数据集未公开,则会提供获取权限的相关指引),并将数据转换为🤗 数据集格式生成副本。
## 引用信息
@article{Santhanam2021ColBERTv2,
title = "ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction",
author = "Keshav Santhanam and Omar Khattab and Jon Saad-Falcon and Christopher Potts and Matei Zaharia",
journal= "arXiv preprint arXiv:2112.01488",
year = "2021",
url = "https://arxiv.org/abs/2112.01488"
}
提供机构:
irds
原始信息汇总
数据集概述
数据集名称
lotte/recreation/test/search
数据来源
- 主数据集:
irds/lotte_recreation_test
数据内容
queries(查询):共924条qrels(相关性评估):共1,991条docs:使用irds/lotte_recreation_test数据集
数据使用示例
python from datasets import load_dataset
queries = load_dataset(irds/lotte_recreation_test_search, queries) for record in queries: record # {query_id: ..., text: ...}
qrels = load_dataset(irds/lotte_recreation_test_search, qrels) for record in qrels: record # {query_id: ..., doc_id: ..., relevance: ..., iteration: ...}
引用信息
@article{Santhanam2021ColBERTv2, title = "ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction", author = "Keshav Santhanam and Omar Khattab and Jon Saad-Falcon and Christopher Potts and Matei Zaharia", journal= "arXiv preprint arXiv:2112.01488", year = "2021", url = "https://arxiv.org/abs/2112.01488" }



