ounstoppableo/github-issues_with_comments
收藏Hugging Face2026-04-08 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/ounstoppableo/github-issues_with_comments
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: url
dtype: string
- name: repository_url
dtype: string
- name: labels_url
dtype: string
- name: comments_url
dtype: string
- name: events_url
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: number
dtype: int64
- name: title
dtype: string
- name: user
struct:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: user_view_type
dtype: string
- name: site_admin
dtype: bool
- name: labels
list:
- name: id
dtype: int64
- name: node_id
dtype: string
- name: url
dtype: string
- name: name
dtype: string
- name: color
dtype: string
- name: default
dtype: bool
- name: description
dtype: string
- name: state
dtype: string
- name: locked
dtype: bool
- name: assignees
list:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: user_view_type
dtype: string
- name: site_admin
dtype: bool
- name: milestone
struct:
- name: url
dtype: string
- name: html_url
dtype: string
- name: labels_url
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: number
dtype: int64
- name: title
dtype: string
- name: description
dtype: string
- name: creator
struct:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: user_view_type
dtype: string
- name: site_admin
dtype: bool
- name: open_issues
dtype: int64
- name: closed_issues
dtype: int64
- name: state
dtype: string
- name: created_at
dtype: timestamp[s]
- name: updated_at
dtype: timestamp[s]
- name: due_on
dtype: 'null'
- name: closed_at
dtype: 'null'
- name: created_at
dtype: timestamp[s]
- name: updated_at
dtype: timestamp[s]
- name: closed_at
dtype: timestamp[s]
- name: assignee
struct:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: user_view_type
dtype: string
- name: site_admin
dtype: bool
- name: author_association
dtype: string
- name: type
dtype: 'null'
- name: active_lock_reason
dtype: 'null'
- name: draft
dtype: bool
- name: pull_request
struct:
- name: url
dtype: string
- name: html_url
dtype: string
- name: diff_url
dtype: string
- name: patch_url
dtype: string
- name: merged_at
dtype: timestamp[s]
- name: body
dtype: string
- name: closed_by
struct:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: user_view_type
dtype: string
- name: site_admin
dtype: bool
- name: reactions
struct:
- name: url
dtype: string
- name: total_count
dtype: int64
- name: '+1'
dtype: int64
- name: '-1'
dtype: int64
- name: laugh
dtype: int64
- name: hooray
dtype: int64
- name: confused
dtype: int64
- name: heart
dtype: int64
- name: rocket
dtype: int64
- name: eyes
dtype: int64
- name: timeline_url
dtype: string
- name: performed_via_github_app
dtype: 'null'
- name: state_reason
dtype: string
- name: sub_issues_summary
struct:
- name: total
dtype: int64
- name: completed
dtype: int64
- name: percent_completed
dtype: int64
- name: issue_dependencies_summary
struct:
- name: blocked_by
dtype: int64
- name: total_blocked_by
dtype: int64
- name: blocking
dtype: int64
- name: total_blocking
dtype: int64
- name: pinned_comment
dtype: 'null'
- name: comments
list: string
splits:
- name: train
num_bytes: 14705364
num_examples: 2324
download_size: 10200903
dataset_size: 14705364
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
数据集信息:
特征集:
- 字段名称:统一资源定位符(URL),数据类型:字符串
- 字段名称:仓库URL(Repository URL),数据类型:字符串
- 字段名称:标签URL(Labels URL),数据类型:字符串
- 字段名称:评论URL(Comments URL),数据类型:字符串
- 字段名称:事件URL(Events URL),数据类型:字符串
- 字段名称:HTML页面URL(HTML URL),数据类型:字符串
- 字段名称:ID,数据类型:64位整数
- 字段名称:节点ID(Node ID),数据类型:字符串
- 字段名称:编号,数据类型:64位整数
- 字段名称:标题,数据类型:字符串
- 字段名称:用户,结构体类型,包含以下子字段:
- 字段名称:登录名(Login),数据类型:字符串
- 字段名称:ID,数据类型:64位整数
- 字段名称:节点ID(Node ID),数据类型:字符串
- 字段名称:头像URL(Avatar URL),数据类型:字符串
- 字段名称:Gravatar ID,数据类型:字符串
- 字段名称:统一资源定位符(URL),数据类型:字符串
- 字段名称:HTML页面URL(HTML URL),数据类型:字符串
- 字段名称:关注者URL(Followers URL),数据类型:字符串
- 字段名称:正在关注URL(Following URL),数据类型:字符串
- 字段名称:Gists列表URL(Gists URL),数据类型:字符串
- 字段名称:已标星内容URL(Starred URL),数据类型:字符串
- 字段名称:订阅URL(Subscriptions URL),数据类型:字符串
- 字段名称:所属组织URL(Organizations URL),数据类型:字符串
- 字段名称:仓库列表URL(Repos URL),数据类型:字符串
- 字段名称:事件URL(Events URL),数据类型:字符串
- 字段名称:接收事件URL(Received Events URL),数据类型:字符串
- 字段名称:类型,数据类型:字符串
- 字段名称:用户视图类型(User View Type),数据类型:字符串
- 字段名称:是否为站点管理员(Site Admin),数据类型:布尔类型
- 字段名称:标签,列表类型,列表内元素为结构体,包含以下子字段:
- 字段名称:ID,数据类型:64位整数
- 字段名称:节点ID(Node ID),数据类型:字符串
- 字段名称:统一资源定位符(URL),数据类型:字符串
- 字段名称:标签名称,数据类型:字符串
- 字段名称:标签颜色,数据类型:字符串
- 字段名称:是否为默认标签(Default),数据类型:布尔类型
- 字段名称:标签描述,数据类型:字符串
- 字段名称:议题状态(State),数据类型:字符串
- 字段名称:是否锁定(Locked),数据类型:布尔类型
- 字段名称:被指派人列表,列表内元素为与前述用户结构体一致的字段
- 字段名称:里程碑(Milestone),结构体类型,包含以下子字段:
- 字段名称:统一资源定位符(URL),数据类型:字符串
- 字段名称:HTML页面URL(HTML URL),数据类型:字符串
- 字段名称:标签URL(Labels URL),数据类型:字符串
- 字段名称:ID,数据类型:64位整数
- 字段名称:节点ID(Node ID),数据类型:字符串
- 字段名称:编号,数据类型:64位整数
- 字段名称:标题,数据类型:字符串
- 字段名称:描述,数据类型:字符串
- 字段名称:创建者,结构体类型,与前述用户结构体一致
- 字段名称:未关闭议题数(Open Issues),数据类型:64位整数
- 字段名称:已关闭议题数(Closed Issues),数据类型:64位整数
- 字段名称:状态,数据类型:字符串
- 字段名称:创建时间(Created At),数据类型:秒级时间戳
- 字段名称:更新时间(Updated At),数据类型:秒级时间戳
- 字段名称:截止时间(Due On),数据类型:空值
- 字段名称:关闭时间(Closed At),数据类型:空值
- 字段名称:议题创建时间(Created At),数据类型:秒级时间戳
- 字段名称:议题更新时间(Updated At),数据类型:秒级时间戳
- 字段名称:议题关闭时间(Closed At),数据类型:秒级时间戳
- 字段名称:被指派人(Assignee),结构体类型,与前述用户结构体一致
- 字段名称:作者关联关系(Author Association),数据类型:字符串
- 字段名称:类型,数据类型:空值
- 字段名称:激活锁定原因(Active Lock Reason),数据类型:空值
- 字段名称:是否为草稿(Draft),数据类型:布尔类型
- 字段名称:拉取请求(Pull Request),结构体类型,包含以下子字段:
- 字段名称:统一资源定位符(URL),数据类型:字符串
- 字段名称:HTML页面URL(HTML URL),数据类型:字符串
- 字段名称:差异文件URL(Diff URL),数据类型:字符串
- 字段名称:补丁文件URL(Patch URL),数据类型:字符串
- 字段名称:合并时间(Merged At),数据类型:秒级时间戳
- 字段名称:议题内容(Body),数据类型:字符串
- 字段名称:关闭者(Closed By),结构体类型,与前述用户结构体一致
- 字段名称:反应(Reactions),结构体类型,包含以下子字段:
- 字段名称:统一资源定位符(URL),数据类型:字符串
- 字段名称:总反应数(Total Count),数据类型:64位整数
- 字段名称:点赞数(+1),数据类型:64位整数
- 字段名称:踩数(-1),数据类型:64位整数
- 字段名称:大笑反应数(Laugh),数据类型:64位整数
- 字段名称:欢呼反应数(Hooray),数据类型:64位整数
- 字段名称:困惑反应数(Confused),数据类型:64位整数
- 字段名称:爱心反应数(Heart),数据类型:64位整数
- 字段名称:火箭反应数(Rocket),数据类型:64位整数
- 字段名称:注视反应数(Eyes),数据类型:64位整数
- 字段名称:时间线URL(Timeline URL),数据类型:字符串
- 字段名称:是否通过GitHub应用执行(Performed Via GitHub App),数据类型:空值
- 字段名称:状态原因(State Reason),数据类型:字符串
- 字段名称:子议题汇总(Sub Issues Summary),结构体类型,包含以下子字段:
- 字段名称:子议题总数(Total),数据类型:64位整数
- 字段名称:已完成子议题数(Completed),数据类型:64位整数
- 字段名称:完成百分比(Percent Completed),数据类型:64位整数
- 字段名称:议题依赖汇总(Issue Dependencies Summary),结构体类型,包含以下子字段:
- 字段名称:依赖的议题数(Blocked By),数据类型:64位整数
- 字段名称:总依赖议题数(Total Blocked By),数据类型:64位整数
- 字段名称:阻塞其他议题数(Blocking),数据类型:64位整数
- 字段名称:总阻塞议题数(Total Blocking),数据类型:64位整数
- 字段名称:置顶评论(Pinned Comment),数据类型:空值
- 字段名称:评论列表(Comments),数据类型:字符串列表
数据集拆分:
- 拆分名称:训练集(Train),字节数:14705364,样本数:2324
下载大小:10200903,数据集大小:14705364
配置项:
- 配置名称:默认配置(Default),数据文件:
- 拆分:训练集,文件路径:data/train-*
提供机构:
ounstoppableo



