irds/codesearchnet_valid
收藏Hugging Face2023-01-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/irds/codesearchnet_valid
下载链接
链接失效反馈官方服务:
资源简介:
`codesearchnet/valid`数据集是由ir-datasets包提供的一个用于文本检索任务的数据集。该数据集包含89,154条查询(queries)和89,154条相关性评估(qrels)。文档(docs)部分需要从`irds/codesearchnet`数据集中获取。数据集的使用示例展示了如何通过`load_dataset`函数加载查询和相关性评估数据。
The `codesearchnet/valid` dataset is a text retrieval dataset provided by the ir-datasets package. It contains 89,154 queries and 89,154 relevance judgments (qrels). The document (docs) subset should be obtained from the `irds/codesearchnet` dataset. Example usages of this dataset demonstrate how to load the query and relevance judgment data via the `load_dataset` function.
提供机构:
irds
原始信息汇总
数据集概述
数据集名称
codesearchnet/valid
数据提供者
由 ir-datasets 包提供。
数据内容
queries(即主题):数量为89,154qrels(相关性评估):数量为89,154
文档数据源
使用示例
python from datasets import load_dataset
queries = load_dataset(irds/codesearchnet_valid, queries) for record in queries: record # {query_id: ..., text: ...}
qrels = load_dataset(irds/codesearchnet_valid, qrels) for record in qrels: record # {query_id: ..., doc_id: ..., relevance: ..., iteration: ...}
引用信息
@article{Husain2019CodeSearchNet, title={CodeSearchNet Challenge: Evaluating the State of Semantic Code Search}, author={Hamel Husain and Ho-Hsiang Wu and Tiferet Gazit and Miltiadis Allamanis and Marc Brockschmidt}, journal={ArXiv}, year={2019} }



