arnmig/github-issues
收藏Hugging Face2023-09-11 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/arnmig/github-issues
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: url
dtype: string
- name: repository_url
dtype: string
- name: labels_url
dtype: string
- name: comments_url
dtype: string
- name: events_url
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: number
dtype: int64
- name: title
dtype: string
- name: user
dtype: string
- name: labels
dtype: string
- name: state
dtype: string
- name: locked
dtype: bool
- name: assignee
dtype: string
- name: assignees
dtype: string
- name: milestone
dtype: string
- name: comments
sequence: string
- name: created_at
dtype: timestamp[s]
- name: updated_at
dtype: timestamp[s]
- name: closed_at
dtype: string
- name: author_association
dtype: string
- name: active_lock_reason
dtype: string
- name: draft
dtype: string
- name: pull_request
dtype: string
- name: body
dtype: string
- name: reactions
dtype: string
- name: timeline_url
dtype: string
- name: performed_via_github_app
dtype: string
- name: state_reason
dtype: string
- name: is_pull_request
dtype: bool
splits:
- name: train
num_bytes: 32536203
num_examples: 6214
download_size: 8102507
dataset_size: 32536203
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
# Dataset Card for "github-issues"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
arnmig
原始信息汇总
数据集概述
数据特征
- url: 字符串类型
- repository_url: 字符串类型
- labels_url: 字符串类型
- comments_url: 字符串类型
- events_url: 字符串类型
- html_url: 字符串类型
- id: 64位整数类型
- node_id: 字符串类型
- number: 64位整数类型
- title: 字符串类型
- user: 字符串类型
- labels: 字符串类型
- state: 字符串类型
- locked: 布尔类型
- assignee: 字符串类型
- assignees: 字符串类型
- milestone: 字符串类型
- comments: 字符串序列
- created_at: 时间戳类型(秒)
- updated_at: 时间戳类型(秒)
- closed_at: 字符串类型
- author_association: 字符串类型
- active_lock_reason: 字符串类型
- draft: 字符串类型
- pull_request: 字符串类型
- body: 字符串类型
- reactions: 字符串类型
- timeline_url: 字符串类型
- performed_via_github_app: 字符串类型
- state_reason: 字符串类型
- is_pull_request: 布尔类型
数据分割
- train: 包含6214个样本,占用32536203字节
数据大小
- 下载大小: 8102507字节
- 数据集大小: 32536203字节
配置
- default: 包含训练数据文件,路径为
data/train-*



