tianyang/repobench_ablation
收藏Hugging Face2023-10-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/tianyang/repobench_ablation
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: cross_file_first
path: data/cross_file_first-*
- split: cross_file_random
path: data/cross_file_random-*
- split: in_file
path: data/in_file-*
dataset_info:
features:
- name: repo_name
dtype: string
- name: file_path
dtype: string
- name: context
list:
- name: identifier
dtype: string
- name: path
dtype: string
- name: snippet
dtype: string
- name: import_statement
dtype: string
- name: token_num
dtype: int64
- name: cropped_code
dtype: string
- name: all_code
dtype: string
- name: next_line
dtype: string
- name: gold_snippet_index
dtype: int64
splits:
- name: cross_file_first
num_bytes: 144850826
num_examples: 1695
- name: cross_file_random
num_bytes: 115858056
num_examples: 1549
- name: in_file
num_bytes: 126244757
num_examples: 1612
download_size: 116113239
dataset_size: 386953639
---
# Dataset Card for "repobench_ablation"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
tianyang
原始信息汇总
数据集概述
数据集配置
- 默认配置:
- 数据文件路径:
cross_file_first:data/cross_file_first-*cross_file_random:data/cross_file_random-*in_file:data/in_file-*
- 数据文件路径:
数据集信息
-
特征:
repo_name:字符串类型file_path:字符串类型context:列表类型identifier:字符串类型path:字符串类型snippet:字符串类型
import_statement:字符串类型token_num:64位整数类型cropped_code:字符串类型all_code:字符串类型next_line:字符串类型gold_snippet_index:64位整数类型
-
数据分割:
cross_file_first:- 字节数:144850826
- 样本数:1695
cross_file_random:- 字节数:115858056
- 样本数:1549
in_file:- 字节数:126244757
- 样本数:1612
-
数据集大小:
- 下载大小:116113239字节
- 数据集大小:386953639字节



