Partha117/oss_bugs_with_chunks
收藏Hugging Face2024-03-13 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Partha117/oss_bugs_with_chunks
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: status
dtype: string
- name: repo_name
dtype: string
- name: repo_url
dtype: string
- name: issue_id
dtype: int64
- name: title
dtype: string
- name: body
dtype: string
- name: issue_url
dtype: string
- name: pull_url
dtype: string
- name: before_fix_sha
dtype: string
- name: after_fix_sha
dtype: string
- name: report_datetime
dtype: timestamp[us, tz=UTC]
- name: language
dtype: string
- name: commit_datetime
dtype: timestamp[us, tz=UTC]
- name: updated_file
dtype: string
- name: chunk_content
dtype: string
splits:
- name: train
num_bytes: 4556954676
num_examples: 1167967
download_size: 346622731
dataset_size: 4556954676
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
Partha117
原始信息汇总
数据集概述
数据集信息
特征
- status: 类型为字符串
- repo_name: 类型为字符串
- repo_url: 类型为字符串
- issue_id: 类型为整数
- title: 类型为字符串
- body: 类型为字符串
- issue_url: 类型为字符串
- pull_url: 类型为字符串
- before_fix_sha: 类型为字符串
- after_fix_sha: 类型为字符串
- report_datetime: 类型为时间戳(微秒,UTC时区)
- language: 类型为字符串
- commit_datetime: 类型为时间戳(微秒,UTC时区)
- updated_file: 类型为字符串
- chunk_content: 类型为字符串
数据分割
- train: 包含1167967个样本,总字节数为4556954676
数据集大小
- 下载大小: 346622731字节
- 数据集大小: 4556954676字节
配置
- default:
- 数据文件:
- split: train
- path: data/train-*
- 数据文件:



