Yus287/y-github-issues
收藏Hugging Face2023-08-03 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Yus287/y-github-issues
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: url
dtype: string
- name: repository_url
dtype: string
- name: labels_url
dtype: string
- name: comments_url
dtype: string
- name: events_url
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: number
dtype: int64
- name: title
dtype: string
- name: user
struct:
- name: avatar_url
dtype: string
- name: events_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: gravatar_id
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: login
dtype: string
- name: node_id
dtype: string
- name: organizations_url
dtype: string
- name: received_events_url
dtype: string
- name: repos_url
dtype: string
- name: site_admin
dtype: bool
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: type
dtype: string
- name: url
dtype: string
- name: labels
list:
- name: color
dtype: string
- name: default
dtype: bool
- name: description
dtype: string
- name: id
dtype: int64
- name: name
dtype: string
- name: node_id
dtype: string
- name: url
dtype: string
- name: state
dtype: string
- name: locked
dtype: bool
- name: assignee
struct:
- name: avatar_url
dtype: string
- name: events_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: gravatar_id
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: login
dtype: string
- name: node_id
dtype: string
- name: organizations_url
dtype: string
- name: received_events_url
dtype: string
- name: repos_url
dtype: string
- name: site_admin
dtype: bool
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: type
dtype: string
- name: url
dtype: string
- name: assignees
list:
- name: avatar_url
dtype: string
- name: events_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: gravatar_id
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: login
dtype: string
- name: node_id
dtype: string
- name: organizations_url
dtype: string
- name: received_events_url
dtype: string
- name: repos_url
dtype: string
- name: site_admin
dtype: bool
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: type
dtype: string
- name: url
dtype: string
- name: milestone
struct:
- name: closed_at
dtype: string
- name: closed_issues
dtype: int64
- name: created_at
dtype: string
- name: creator
struct:
- name: avatar_url
dtype: string
- name: events_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: gravatar_id
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: login
dtype: string
- name: node_id
dtype: string
- name: organizations_url
dtype: string
- name: received_events_url
dtype: string
- name: repos_url
dtype: string
- name: site_admin
dtype: bool
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: type
dtype: string
- name: url
dtype: string
- name: description
dtype: string
- name: due_on
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: labels_url
dtype: string
- name: node_id
dtype: string
- name: number
dtype: int64
- name: open_issues
dtype: int64
- name: state
dtype: string
- name: title
dtype: string
- name: updated_at
dtype: string
- name: url
dtype: string
- name: comments
sequence: string
- name: created_at
dtype: string
- name: updated_at
dtype: string
- name: closed_at
dtype: string
- name: author_association
dtype: string
- name: active_lock_reason
dtype: 'null'
- name: draft
dtype: bool
- name: pull_request
struct:
- name: diff_url
dtype: string
- name: html_url
dtype: string
- name: merged_at
dtype: string
- name: patch_url
dtype: string
- name: url
dtype: string
- name: body
dtype: string
- name: reactions
struct:
- name: '+1'
dtype: int64
- name: '-1'
dtype: int64
- name: confused
dtype: int64
- name: eyes
dtype: int64
- name: heart
dtype: int64
- name: hooray
dtype: int64
- name: laugh
dtype: int64
- name: rocket
dtype: int64
- name: total_count
dtype: int64
- name: url
dtype: string
- name: timeline_url
dtype: string
- name: performed_via_github_app
dtype: 'null'
- name: state_reason
dtype: string
- name: is_pull_request
dtype: bool
splits:
- name: train
num_bytes: 19943411.093366094
num_examples: 3907
- name: test
num_bytes: 1000488.5012285012
num_examples: 196
- name: val
num_bytes: 3986640.4054054054
num_examples: 781
download_size: 7953657
dataset_size: 24930540.0
---
# Dataset Card for "y-github-issues"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
Yus287
原始信息汇总
数据集概述
数据集名称
"y-github-issues"
数据集特征
-
基本特征:
url: 字符串repository_url: 字符串labels_url: 字符串comments_url: 字符串events_url: 字符串html_url: 字符串id: 整数node_id: 字符串number: 整数title: 字符串
-
用户特征:
user: 结构体,包含多个子特征,如avatar_url,events_url,followers_url,following_url,gists_url,gravatar_id,html_url,id,login,node_id,organizations_url,received_events_url,repos_url,site_admin,starred_url,subscriptions_url,type,url,均为字符串或布尔类型。
-
标签特征:
labels: 列表,包含多个子特征,如color,default,description,id,name,node_id,url,均为字符串或布尔类型。
-
状态与锁定:
state: 字符串locked: 布尔类型
-
指派者特征:
assignee: 结构体,包含多个子特征,与user特征相同。
-
指派者列表:
assignees: 列表,包含多个子特征,与user特征相同。
-
里程碑特征:
milestone: 结构体,包含多个子特征,如closed_at,closed_issues,created_at,creator(结构体,包含与user相同的子特征),description,due_on,html_url,id,labels_url,node_id,number,open_issues,state,title,updated_at,url,均为字符串或整数类型。
-
其他特征:
comments: 字符串序列created_at: 字符串updated_at: 字符串closed_at: 字符串author_association: 字符串active_lock_reason: 空值draft: 布尔类型pull_request: 结构体,包含diff_url,html_url,merged_at,patch_url,url,均为字符串类型。body: 字符串reactions: 结构体,包含多个子特征,如+1,-1,confused,eyes,heart,hooray,laugh,rocket,total_count,url,均为整数或字符串类型。timeline_url: 字符串performed_via_github_app: 空值state_reason: 字符串is_pull_request: 布尔类型
数据集分割
- 训练集: 3907个样本,大小为19943411.093366094字节
- 测试集: 196个样本,大小为1000488.5012285012字节
- 验证集: 781个样本,大小为3986640.4054054054字节
数据集大小
- 下载大小: 7953657字节
- 数据集大小: 24930540.0字节



