CoIR-Retrieval/CodeSearchNet-ccr-go-queries-corpus
收藏Hugging Face2024-06-25 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/CoIR-Retrieval/CodeSearchNet-ccr-go-queries-corpus
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: _id
dtype: string
- name: title
dtype: string
- name: partition
dtype: string
- name: text
dtype: string
- name: language
dtype: string
- name: meta_information
struct:
- name: resource
dtype: string
splits:
- name: queries
num_bytes: 52487127
num_examples: 182735
- name: corpus
num_bytes: 40674054
num_examples: 182735
download_size: 40750954
dataset_size: 93161181
configs:
- config_name: default
data_files:
- split: queries
path: data/queries-*
- split: corpus
path: data/corpus-*
---
# Dataset Card for "CodeSearchNet-ccr-go-queries-corpus"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
数据集信息:
特征列表:
- 字段名:_id,数据类型:字符串
- 字段名:title(标题),数据类型:字符串
- 字段名:partition(分区),数据类型:字符串
- 字段名:text(文本内容),数据类型:字符串
- 字段名:language(语言),数据类型:字符串
- 字段名:meta_information(元信息),为结构体类型,包含子字段:
- 子字段名:resource(资源),数据类型:字符串
数据集划分:
- 划分名称:queries(查询集),占用字节数:52487127,样本总数:182735
- 划分名称:corpus(语料库),占用字节数:40674054,样本总数:182735
下载总大小:40750954,数据集总存储大小:93161181
配置项:
- 配置名称:default(默认配置),对应数据文件:
- 对应划分集queries,数据路径为data/queries-*
- 对应划分集corpus,数据路径为data/corpus-*
---
# 「CodeSearchNet-ccr-go-queries-corpus」数据集卡片
[需补充更多信息](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
CoIR-Retrieval



