jayashan10/github-issues
收藏Hugging Face2024-06-06 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/jayashan10/github-issues
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: url
dtype: string
- name: repository_url
dtype: string
- name: labels_url
dtype: string
- name: comments_url
dtype: string
- name: events_url
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: number
dtype: int64
- name: title
dtype: string
- name: user
struct:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: site_admin
dtype: bool
- name: labels
list:
- name: id
dtype: int64
- name: node_id
dtype: string
- name: url
dtype: string
- name: name
dtype: string
- name: color
dtype: string
- name: default
dtype: bool
- name: description
dtype: string
- name: state
dtype: string
- name: locked
dtype: bool
- name: assignee
struct:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: site_admin
dtype: bool
- name: assignees
list:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: site_admin
dtype: bool
- name: milestone
struct:
- name: url
dtype: string
- name: html_url
dtype: string
- name: labels_url
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: number
dtype: int64
- name: title
dtype: string
- name: description
dtype: string
- name: creator
struct:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: site_admin
dtype: bool
- name: open_issues
dtype: int64
- name: closed_issues
dtype: int64
- name: state
dtype: string
- name: created_at
dtype: timestamp[s]
- name: updated_at
dtype: timestamp[s]
- name: due_on
dtype: 'null'
- name: closed_at
dtype: 'null'
- name: comments
sequence: string
- name: created_at
dtype: timestamp[s]
- name: updated_at
dtype: timestamp[s]
- name: closed_at
dtype: timestamp[s]
- name: author_association
dtype: string
- name: active_lock_reason
dtype: 'null'
- name: body
dtype: string
- name: reactions
struct:
- name: url
dtype: string
- name: total_count
dtype: int64
- name: '+1'
dtype: int64
- name: '-1'
dtype: int64
- name: laugh
dtype: int64
- name: hooray
dtype: int64
- name: confused
dtype: int64
- name: heart
dtype: int64
- name: rocket
dtype: int64
- name: eyes
dtype: int64
- name: timeline_url
dtype: string
- name: performed_via_github_app
dtype: 'null'
- name: state_reason
dtype: string
- name: draft
dtype: bool
- name: pull_request
struct:
- name: url
dtype: string
- name: html_url
dtype: string
- name: diff_url
dtype: string
- name: patch_url
dtype: string
- name: merged_at
dtype: timestamp[s]
- name: is_pull_request
dtype: bool
splits:
- name: train
num_bytes: 10632519
num_examples: 1000
download_size: 2949822
dataset_size: 10632519
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
The dataset appears to be related to GitHub issues or pull requests. It includes various features such as URLs, user information, labels, state, assignees, milestone details, comments, timestamps, reactions, and pull request details. Each feature is described with its name and data type, and some features are structured or list types containing multiple sub-features. The dataset is split into a training set with 1000 examples and includes information about the download and dataset sizes.
提供机构:
jayashan10
原始信息汇总
数据集概述
数据集特征
-
基本信息
url: 字符串类型repository_url: 字符串类型labels_url: 字符串类型comments_url: 字符串类型events_url: 字符串类型html_url: 字符串类型id: 整数类型node_id: 字符串类型number: 整数类型title: 字符串类型
-
用户信息
user: 结构体类型,包含以下字段:login: 字符串类型id: 整数类型node_id: 字符串类型avatar_url: 字符串类型gravatar_id: 字符串类型url: 字符串类型html_url: 字符串类型followers_url: 字符串类型following_url: 字符串类型gists_url: 字符串类型starred_url: 字符串类型subscriptions_url: 字符串类型organizations_url: 字符串类型repos_url: 字符串类型events_url: 字符串类型received_events_url: 字符串类型type: 字符串类型site_admin: 布尔类型
-
标签信息
labels: 列表类型,包含以下字段:id: 整数类型node_id: 字符串类型url: 字符串类型name: 字符串类型color: 字符串类型default: 布尔类型description: 字符串类型
-
状态与锁定
state: 字符串类型locked: 布尔类型
-
指派信息
assignee: 结构体类型,字段与user相同assignees: 列表类型,字段与user相同
-
里程碑信息
milestone: 结构体类型,包含以下字段:url: 字符串类型html_url: 字符串类型labels_url: 字符串类型id: 整数类型node_id: 字符串类型number: 整数类型title: 字符串类型description: 字符串类型creator: 结构体类型,字段与user相同open_issues: 整数类型closed_issues: 整数类型state: 字符串类型created_at: 时间戳类型updated_at: 时间戳类型due_on: 空值closed_at: 空值
-
其他信息
comments: 字符串序列created_at: 时间戳类型updated_at: 时间戳类型closed_at: 时间戳类型author_association: 字符串类型active_lock_reason: 空值body: 字符串类型reactions: 结构体类型,包含反应计数timeline_url: 字符串类型performed_via_github_app: 空值state_reason: 字符串类型draft: 布尔类型pull_request: 结构体类型,包含拉取请求相关信息is_pull_request: 布尔类型
数据集划分
- 训练集
train: 包含1000个示例,总大小为10632519字节。
数据集大小
- 下载大小: 2949822字节
- 数据集大小: 10632519字节



