HusseinEid/github-issues
收藏Hugging Face2024-05-29 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/HusseinEid/github-issues
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: url
dtype: string
- name: repository_url
dtype: string
- name: labels_url
dtype: string
- name: comments_url
dtype: string
- name: events_url
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: number
dtype: int64
- name: title
dtype: string
- name: user
struct:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: site_admin
dtype: bool
- name: labels
list:
- name: id
dtype: int64
- name: node_id
dtype: string
- name: url
dtype: string
- name: name
dtype: string
- name: color
dtype: string
- name: default
dtype: bool
- name: description
dtype: string
- name: state
dtype: string
- name: locked
dtype: bool
- name: assignees
list:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: site_admin
dtype: bool
- name: comments
sequence: string
- name: created_at
dtype: int64
- name: updated_at
dtype: int64
- name: author_association
dtype: string
- name: reactions
struct:
- name: url
dtype: string
- name: total_count
dtype: int64
- name: '+1'
dtype: int64
- name: '-1'
dtype: int64
- name: laugh
dtype: int64
- name: hooray
dtype: int64
- name: confused
dtype: int64
- name: heart
dtype: int64
- name: rocket
dtype: int64
- name: eyes
dtype: int64
- name: timeline_url
dtype: string
splits:
- name: train
num_bytes: 11173497
num_examples: 6888
download_size: 1827058
dataset_size: 11173497
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
This dataset contains detailed information about GitHub projects, including project URLs, repository URLs, labels URLs, etc. Each project record includes detailed data such as user information, labels, assignees, suitable for analyzing GitHub projects and related research.
提供机构:
HusseinEid
原始信息汇总
数据集概述
数据集特征
基本特征
- url: 字符串类型
- repository_url: 字符串类型
- labels_url: 字符串类型
- comments_url: 字符串类型
- events_url: 字符串类型
- html_url: 字符串类型
- id: 整数类型
- node_id: 字符串类型
- number: 整数类型
- title: 字符串类型
用户特征
- user: 结构体类型,包含以下字段:
- login: 字符串类型
- id: 整数类型
- node_id: 字符串类型
- avatar_url: 字符串类型
- gravatar_id: 字符串类型
- url: 字符串类型
- html_url: 字符串类型
- followers_url: 字符串类型
- following_url: 字符串类型
- gists_url: 字符串类型
- starred_url: 字符串类型
- subscriptions_url: 字符串类型
- organizations_url: 字符串类型
- repos_url: 字符串类型
- events_url: 字符串类型
- received_events_url: 字符串类型
- type: 字符串类型
- site_admin: 布尔类型
标签特征
- labels: 列表类型,包含以下字段:
- id: 整数类型
- node_id: 字符串类型
- url: 字符串类型
- name: 字符串类型
- color: 字符串类型
- default: 布尔类型
- description: 字符串类型
其他特征
- state: 字符串类型
- locked: 布尔类型
- assignees: 列表类型,结构与用户特征相同
- comments: 字符串序列类型
- created_at: 整数类型
- updated_at: 整数类型
- author_association: 字符串类型
- reactions: 结构体类型,包含以下字段:
- url: 字符串类型
- total_count: 整数类型
- +1: 整数类型
- -1: 整数类型
- laugh: 整数类型
- hooray: 整数类型
- confused: 整数类型
- heart: 整数类型
- rocket: 整数类型
- eyes: 整数类型
- timeline_url: 字符串类型
数据集划分
- train: 训练集,包含6888个样本,总大小为11173497字节。
数据集大小
- 下载大小: 1827058字节
- 数据集总大小: 11173497字节



