tianyang/repobench_ablation_64k
收藏Hugging Face2023-10-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/tianyang/repobench_ablation_64k
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: cross_file_first
path: data/cross_file_first-*
- split: cross_file_random
path: data/cross_file_random-*
- split: in_file
path: data/in_file-*
dataset_info:
features:
- name: repo_name
dtype: string
- name: file_path
dtype: string
- name: context
list:
- name: identifier
dtype: string
- name: path
dtype: string
- name: snippet
dtype: string
- name: import_statement
dtype: string
- name: token_num
dtype: int64
- name: cropped_code
dtype: string
- name: all_code
dtype: string
- name: next_line
dtype: string
- name: gold_snippet_index
dtype: int64
splits:
- name: cross_file_first
num_bytes: 248891343
num_examples: 3644
- name: cross_file_random
num_bytes: 201499818
num_examples: 3321
- name: in_file
num_bytes: 231190696
num_examples: 3549
download_size: 209884822
dataset_size: 681581857
---
# Dataset Card for "repobench_ablation_64k"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
tianyang
原始信息汇总
数据集概述
数据集名称
- repobench_ablation_64k
配置
- 默认配置
数据文件
- 分割: cross_file_first
- 路径: data/cross_file_first-*
- 分割: cross_file_random
- 路径: data/cross_file_random-*
- 分割: in_file
- 路径: data/in_file-*
数据集信息
特征
- repo_name: 字符串
- file_path: 字符串
- context: 列表
- identifier: 字符串
- path: 字符串
- snippet: 字符串
- import_statement: 字符串
- token_num: 整数 (int64)
- cropped_code: 字符串
- all_code: 字符串
- next_line: 字符串
- gold_snippet_index: 整数 (int64)
分割
- cross_file_first
- 字节数: 248891343
- 样本数: 3644
- cross_file_random
- 字节数: 201499818
- 样本数: 3321
- in_file
- 字节数: 231190696
- 样本数: 3549
大小
- 下载大小: 209884822 字节
- 数据集大小: 681581857 字节



