anubhavmaity/github-issues
收藏Hugging Face2023-10-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/anubhavmaity/github-issues
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: url
dtype: string
- name: repository_url
dtype: string
- name: labels_url
dtype: string
- name: comments_url
dtype: string
- name: events_url
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: number
dtype: int64
- name: title
dtype: string
- name: user
dtype: string
- name: labels
dtype: string
- name: state
dtype: string
- name: locked
dtype: bool
- name: assignee
dtype: string
- name: assignees
dtype: string
- name: milestone
dtype: string
- name: comments
dtype: string
- name: created_at
dtype: string
- name: updated_at
dtype: string
- name: closed_at
dtype: string
- name: author_association
dtype: string
- name: active_lock_reason
dtype: float64
- name: body
dtype: string
- name: reactions
dtype: string
- name: timeline_url
dtype: string
- name: performed_via_github_app
dtype: float64
- name: state_reason
dtype: string
- name: draft
dtype: float64
- name: pull_request
dtype: string
- name: is_pull_request
dtype: bool
splits:
- name: train
num_bytes: 35370223
num_examples: 6279
download_size: 9128830
dataset_size: 35370223
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
annotations_creators:
- other
language:
- en
language_creators:
- other
license: []
multilinguality:
- monolingual
pretty_name: Github Issues
size_categories:
- 1K<n<10K
source_datasets:
- original
tags:
- github-issues
- huggingface-nlp-course
- datasets
task_categories:
- text-classification
- text-retrieval
task_ids:
- multi-class-classification
- multi-label-classification
- document-retrieval
提供机构:
anubhavmaity
原始信息汇总
数据集概述
数据集信息
-
特征列表:
url: 字符串repository_url: 字符串labels_url: 字符串comments_url: 字符串events_url: 字符串html_url: 字符串id: 整数node_id: 字符串number: 整数title: 字符串user: 字符串labels: 字符串state: 字符串locked: 布尔值assignee: 字符串assignees: 字符串milestone: 字符串comments: 字符串created_at: 字符串updated_at: 字符串closed_at: 字符串author_association: 字符串active_lock_reason: 浮点数body: 字符串reactions: 字符串timeline_url: 字符串performed_via_github_app: 浮点数state_reason: 字符串draft: 浮点数pull_request: 字符串is_pull_request: 布尔值
-
数据分割:
train: 35,370,223 字节, 6,279 样本
-
数据集大小:
- 下载大小: 9,128,830 字节
- 数据集大小: 35,370,223 字节
-
配置:
default- 数据文件:
train:data/train-*
- 数据文件:
数据集属性
-
注释创建者:
- 其他
-
语言:
- 英语
-
语言创建者:
- 其他
-
许可证:
- 无
-
多语言性:
- 单语种
-
数据集名称:
- Github Issues
-
大小类别:
- 1K<n<10K
-
源数据集:
- 原始数据
-
标签:
- github-issues
- huggingface-nlp-course
- datasets
-
任务类别:
- 文本分类
- 文本检索
-
任务ID:
- 多类分类
- 多标签分类
- 文档检索



