five

Yus287/y-github-issues

收藏
Hugging Face2023-08-03 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Yus287/y-github-issues
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: url dtype: string - name: repository_url dtype: string - name: labels_url dtype: string - name: comments_url dtype: string - name: events_url dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: node_id dtype: string - name: number dtype: int64 - name: title dtype: string - name: user struct: - name: avatar_url dtype: string - name: events_url dtype: string - name: followers_url dtype: string - name: following_url dtype: string - name: gists_url dtype: string - name: gravatar_id dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: login dtype: string - name: node_id dtype: string - name: organizations_url dtype: string - name: received_events_url dtype: string - name: repos_url dtype: string - name: site_admin dtype: bool - name: starred_url dtype: string - name: subscriptions_url dtype: string - name: type dtype: string - name: url dtype: string - name: labels list: - name: color dtype: string - name: default dtype: bool - name: description dtype: string - name: id dtype: int64 - name: name dtype: string - name: node_id dtype: string - name: url dtype: string - name: state dtype: string - name: locked dtype: bool - name: assignee struct: - name: avatar_url dtype: string - name: events_url dtype: string - name: followers_url dtype: string - name: following_url dtype: string - name: gists_url dtype: string - name: gravatar_id dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: login dtype: string - name: node_id dtype: string - name: organizations_url dtype: string - name: received_events_url dtype: string - name: repos_url dtype: string - name: site_admin dtype: bool - name: starred_url dtype: string - name: subscriptions_url dtype: string - name: type dtype: string - name: url dtype: string - name: assignees list: - name: avatar_url dtype: string - name: events_url dtype: string - name: followers_url dtype: string - name: following_url dtype: string - name: gists_url dtype: string - name: gravatar_id dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: login dtype: string - name: node_id dtype: string - name: organizations_url dtype: string - name: received_events_url dtype: string - name: repos_url dtype: string - name: site_admin dtype: bool - name: starred_url dtype: string - name: subscriptions_url dtype: string - name: type dtype: string - name: url dtype: string - name: milestone struct: - name: closed_at dtype: string - name: closed_issues dtype: int64 - name: created_at dtype: string - name: creator struct: - name: avatar_url dtype: string - name: events_url dtype: string - name: followers_url dtype: string - name: following_url dtype: string - name: gists_url dtype: string - name: gravatar_id dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: login dtype: string - name: node_id dtype: string - name: organizations_url dtype: string - name: received_events_url dtype: string - name: repos_url dtype: string - name: site_admin dtype: bool - name: starred_url dtype: string - name: subscriptions_url dtype: string - name: type dtype: string - name: url dtype: string - name: description dtype: string - name: due_on dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: labels_url dtype: string - name: node_id dtype: string - name: number dtype: int64 - name: open_issues dtype: int64 - name: state dtype: string - name: title dtype: string - name: updated_at dtype: string - name: url dtype: string - name: comments sequence: string - name: created_at dtype: string - name: updated_at dtype: string - name: closed_at dtype: string - name: author_association dtype: string - name: active_lock_reason dtype: 'null' - name: draft dtype: bool - name: pull_request struct: - name: diff_url dtype: string - name: html_url dtype: string - name: merged_at dtype: string - name: patch_url dtype: string - name: url dtype: string - name: body dtype: string - name: reactions struct: - name: '+1' dtype: int64 - name: '-1' dtype: int64 - name: confused dtype: int64 - name: eyes dtype: int64 - name: heart dtype: int64 - name: hooray dtype: int64 - name: laugh dtype: int64 - name: rocket dtype: int64 - name: total_count dtype: int64 - name: url dtype: string - name: timeline_url dtype: string - name: performed_via_github_app dtype: 'null' - name: state_reason dtype: string - name: is_pull_request dtype: bool splits: - name: train num_bytes: 19943411.093366094 num_examples: 3907 - name: test num_bytes: 1000488.5012285012 num_examples: 196 - name: val num_bytes: 3986640.4054054054 num_examples: 781 download_size: 7953657 dataset_size: 24930540.0 --- # Dataset Card for "y-github-issues" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
Yus287
原始信息汇总

数据集概述

数据集名称

"y-github-issues"

数据集特征

  • 基本特征:

    • url: 字符串
    • repository_url: 字符串
    • labels_url: 字符串
    • comments_url: 字符串
    • events_url: 字符串
    • html_url: 字符串
    • id: 整数
    • node_id: 字符串
    • number: 整数
    • title: 字符串
  • 用户特征:

    • user: 结构体,包含多个子特征,如avatar_url, events_url, followers_url, following_url, gists_url, gravatar_id, html_url, id, login, node_id, organizations_url, received_events_url, repos_url, site_admin, starred_url, subscriptions_url, type, url,均为字符串或布尔类型。
  • 标签特征:

    • labels: 列表,包含多个子特征,如color, default, description, id, name, node_id, url,均为字符串或布尔类型。
  • 状态与锁定:

    • state: 字符串
    • locked: 布尔类型
  • 指派者特征:

    • assignee: 结构体,包含多个子特征,与user特征相同。
  • 指派者列表:

    • assignees: 列表,包含多个子特征,与user特征相同。
  • 里程碑特征:

    • milestone: 结构体,包含多个子特征,如closed_at, closed_issues, created_at, creator(结构体,包含与user相同的子特征), description, due_on, html_url, id, labels_url, node_id, number, open_issues, state, title, updated_at, url,均为字符串或整数类型。
  • 其他特征:

    • comments: 字符串序列
    • created_at: 字符串
    • updated_at: 字符串
    • closed_at: 字符串
    • author_association: 字符串
    • active_lock_reason: 空值
    • draft: 布尔类型
    • pull_request: 结构体,包含diff_url, html_url, merged_at, patch_url, url,均为字符串类型。
    • body: 字符串
    • reactions: 结构体,包含多个子特征,如+1, -1, confused, eyes, heart, hooray, laugh, rocket, total_count, url,均为整数或字符串类型。
    • timeline_url: 字符串
    • performed_via_github_app: 空值
    • state_reason: 字符串
    • is_pull_request: 布尔类型

数据集分割

  • 训练集: 3907个样本,大小为19943411.093366094字节
  • 测试集: 196个样本,大小为1000488.5012285012字节
  • 验证集: 781个样本,大小为3986640.4054054054字节

数据集大小

  • 下载大小: 7953657字节
  • 数据集大小: 24930540.0字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作