linqus/github-issues

Name: linqus/github-issues
Creator: linqus
Published: 2023-12-10 17:57:29
License: 暂无描述

Hugging Face2023-12-10 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/linqus/github-issues

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: url dtype: string - name: repository_url dtype: string - name: labels_url dtype: string - name: comments_url dtype: string - name: events_url dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: node_id dtype: string - name: number dtype: int64 - name: title dtype: string - name: user struct: - name: avatar_url dtype: string - name: events_url dtype: string - name: followers_url dtype: string - name: following_url dtype: string - name: gists_url dtype: string - name: gravatar_id dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: login dtype: string - name: node_id dtype: string - name: organizations_url dtype: string - name: received_events_url dtype: string - name: repos_url dtype: string - name: site_admin dtype: bool - name: starred_url dtype: string - name: subscriptions_url dtype: string - name: type dtype: string - name: url dtype: string - name: labels list: - name: color dtype: string - name: default dtype: bool - name: description dtype: string - name: id dtype: int64 - name: name dtype: string - name: node_id dtype: string - name: url dtype: string - name: state dtype: string - name: locked dtype: bool - name: assignee struct: - name: avatar_url dtype: string - name: events_url dtype: string - name: followers_url dtype: string - name: following_url dtype: string - name: gists_url dtype: string - name: gravatar_id dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: login dtype: string - name: node_id dtype: string - name: organizations_url dtype: string - name: received_events_url dtype: string - name: repos_url dtype: string - name: site_admin dtype: bool - name: starred_url dtype: string - name: subscriptions_url dtype: string - name: type dtype: string - name: url dtype: string - name: assignees list: - name: avatar_url dtype: string - name: events_url dtype: string - name: followers_url dtype: string - name: following_url dtype: string - name: gists_url dtype: string - name: gravatar_id dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: login dtype: string - name: node_id dtype: string - name: organizations_url dtype: string - name: received_events_url dtype: string - name: repos_url dtype: string - name: site_admin dtype: bool - name: starred_url dtype: string - name: subscriptions_url dtype: string - name: type dtype: string - name: url dtype: string - name: milestone struct: - name: closed_at dtype: string - name: closed_issues dtype: int64 - name: created_at dtype: string - name: creator struct: - name: avatar_url dtype: string - name: events_url dtype: string - name: followers_url dtype: string - name: following_url dtype: string - name: gists_url dtype: string - name: gravatar_id dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: login dtype: string - name: node_id dtype: string - name: organizations_url dtype: string - name: received_events_url dtype: string - name: repos_url dtype: string - name: site_admin dtype: bool - name: starred_url dtype: string - name: subscriptions_url dtype: string - name: type dtype: string - name: url dtype: string - name: description dtype: string - name: due_on dtype: string - name: html_url dtype: string - name: id dtype: int64 - name: labels_url dtype: string - name: node_id dtype: string - name: number dtype: int64 - name: open_issues dtype: int64 - name: state dtype: string - name: title dtype: string - name: updated_at dtype: string - name: url dtype: string - name: comments sequence: string - name: created_at dtype: timestamp[ns, tz=UTC] - name: updated_at dtype: timestamp[ns, tz=UTC] - name: closed_at dtype: timestamp[ns, tz=UTC] - name: author_association dtype: string - name: active_lock_reason dtype: float64 - name: body dtype: string - name: reactions struct: - name: '+1' dtype: int64 - name: '-1' dtype: int64 - name: confused dtype: int64 - name: eyes dtype: int64 - name: heart dtype: int64 - name: hooray dtype: int64 - name: laugh dtype: int64 - name: rocket dtype: int64 - name: total_count dtype: int64 - name: url dtype: string - name: timeline_url dtype: string - name: performed_via_github_app dtype: float64 - name: state_reason dtype: string - name: draft dtype: float64 - name: pull_request struct: - name: diff_url dtype: string - name: html_url dtype: string - name: merged_at dtype: string - name: patch_url dtype: string - name: url dtype: string - name: is_pull_request dtype: bool splits: - name: train num_bytes: 1717058 num_examples: 100 download_size: 564909 dataset_size: 1717058 configs: - config_name: default data_files: - split: train path: data/train-* --- # Dataset Card for Dataset Name  This dataset card aims to be a base template for new datasets. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/datasetcard_template.md?plain=1). ## Dataset Details ### Dataset Description  - **Curated by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] ### Dataset Sources [optional]  - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses  ### Direct Use  [More Information Needed] ### Out-of-Scope Use  [More Information Needed] ## Dataset Structure  [More Information Needed] ## Dataset Creation ### Curation Rationale  [More Information Needed] ### Source Data  #### Data Collection and Processing  [More Information Needed] #### Who are the source data producers?  [More Information Needed] ### Annotations [optional]  #### Annotation process  [More Information Needed] #### Who are the annotators?  [More Information Needed] #### Personal and Sensitive Information  [More Information Needed] ## Bias, Risks, and Limitations  [More Information Needed] ### Recommendations  Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations. ## Citation [optional]  **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional]  [More Information Needed] ## More Information [optional] [More Information Needed] ## Dataset Card Authors [optional] [More Information Needed] ## Dataset Card Contact [More Information Needed]

提供机构：

linqus

原始信息汇总

数据集详情

数据集描述

数据集包含以下特征：

url: 字符串类型
repository_url: 字符串类型
labels_url: 字符串类型
comments_url: 字符串类型
events_url: 字符串类型
html_url: 字符串类型
id: 64位整数类型
node_id: 字符串类型
number: 64位整数类型
title: 字符串类型
user: 结构体类型，包含以下字段：
- avatar_url: 字符串类型
- events_url: 字符串类型
- followers_url: 字符串类型
- following_url: 字符串类型
- gists_url: 字符串类型
- gravatar_id: 字符串类型
- html_url: 字符串类型
- id: 64位整数类型
- login: 字符串类型
- node_id: 字符串类型
- organizations_url: 字符串类型
- received_events_url: 字符串类型
- repos_url: 字符串类型
- site_admin: 布尔类型
- starred_url: 字符串类型
- subscriptions_url: 字符串类型
- type: 字符串类型
- url: 字符串类型
labels: 列表类型，包含以下字段：
- color: 字符串类型
- default: 布尔类型
- description: 字符串类型
- id: 64位整数类型
- name: 字符串类型
- node_id: 字符串类型
- url: 字符串类型
state: 字符串类型
locked: 布尔类型
assignee: 结构体类型，包含以下字段：
- avatar_url: 字符串类型
- events_url: 字符串类型
- followers_url: 字符串类型
- following_url: 字符串类型
- gists_url: 字符串类型
- gravatar_id: 字符串类型
- html_url: 字符串类型
- id: 64位整数类型
- login: 字符串类型
- node_id: 字符串类型
- organizations_url: 字符串类型
- received_events_url: 字符串类型
- repos_url: 字符串类型
- site_admin: 布尔类型
- starred_url: 字符串类型
- subscriptions_url: 字符串类型
- type: 字符串类型
- url: 字符串类型
assignees: 列表类型，包含以下字段：
- avatar_url: 字符串类型
- events_url: 字符串类型
- followers_url: 字符串类型
- following_url: 字符串类型
- gists_url: 字符串类型
- gravatar_id: 字符串类型
- html_url: 字符串类型
- id: 64位整数类型
- login: 字符串类型
- node_id: 字符串类型
- organizations_url: 字符串类型
- received_events_url: 字符串类型
- repos_url: 字符串类型
- site_admin: 布尔类型
- starred_url: 字符串类型
- subscriptions_url: 字符串类型
- type: 字符串类型
- url: 字符串类型
milestone: 结构体类型，包含以下字段：
- closed_at: 字符串类型
- closed_issues: 64位整数类型
- created_at: 字符串类型
- creator: 结构体类型，包含以下字段：
  - avatar_url: 字符串类型
  - events_url: 字符串类型
  - followers_url: 字符串类型
  - following_url: 字符串类型
  - gists_url: 字符串类型
  - gravatar_id: 字符串类型
  - html_url: 字符串类型
  - id: 64位整数类型
  - login: 字符串类型
  - node_id: 字符串类型
  - organizations_url: 字符串类型
  - received_events_url: 字符串类型
  - repos_url: 字符串类型
  - site_admin: 布尔类型
  - starred_url: 字符串类型
  - subscriptions_url: 字符串类型
  - type: 字符串类型
  - url: 字符串类型
- description: 字符串类型
- due_on: 字符串类型
- html_url: 字符串类型
- id: 64位整数类型
- labels_url: 字符串类型
- node_id: 字符串类型
- number: 64位整数类型
- open_issues: 64位整数类型
- state: 字符串类型
- title: 字符串类型
- updated_at: 字符串类型
- url: 字符串类型
comments: 字符串序列类型
created_at: 时间戳类型，UTC时区
updated_at: 时间戳类型，UTC时区
closed_at: 时间戳类型，UTC时区
author_association: 字符串类型
active_lock_reason: 64位浮点数类型
body: 字符串类型
reactions: 结构体类型，包含以下字段：
- +1: 64位整数类型
- -1: 64位整数类型
- confused: 64位整数类型
- eyes: 64位整数类型
- heart: 64位整数类型
- hooray: 64位整数类型
- laugh: 64位整数类型
- rocket: 64位整数类型
- total_count: 64位整数类型
- url: 字符串类型
timeline_url: 字符串类型
performed_via_github_app: 64位浮点数类型
state_reason: 字符串类型
draft: 64位浮点数类型
pull_request: 结构体类型，包含以下字段：
- diff_url: 字符串类型
- html_url: 字符串类型
- merged_at: 字符串类型
- patch_url: 字符串类型
- url: 字符串类型
is_pull_request: 布尔类型

数据集分割

train: 包含100个样本，总字节数为1717058

数据集大小

下载大小: 564909字节
数据集大小: 1717058字节

配置

default: 包含训练数据文件，路径为data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集