scriptmoney/OpenVoice-issues
收藏Hugging Face2024-06-07 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/scriptmoney/OpenVoice-issues
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: url
dtype: string
- name: repository_url
dtype: string
- name: labels_url
dtype: string
- name: comments_url
dtype: string
- name: events_url
dtype: string
- name: html_url
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: number
dtype: int64
- name: title
dtype: string
- name: user
struct:
- name: login
dtype: string
- name: id
dtype: int64
- name: node_id
dtype: string
- name: avatar_url
dtype: string
- name: gravatar_id
dtype: string
- name: url
dtype: string
- name: html_url
dtype: string
- name: followers_url
dtype: string
- name: following_url
dtype: string
- name: gists_url
dtype: string
- name: starred_url
dtype: string
- name: subscriptions_url
dtype: string
- name: organizations_url
dtype: string
- name: repos_url
dtype: string
- name: events_url
dtype: string
- name: received_events_url
dtype: string
- name: type
dtype: string
- name: site_admin
dtype: bool
- name: labels
sequence: 'null'
- name: state
dtype: string
- name: locked
dtype: bool
- name: assignee
dtype: 'null'
- name: assignees
sequence: 'null'
- name: milestone
dtype: 'null'
- name: comments
sequence: string
- name: created_at
dtype: timestamp[s]
- name: updated_at
dtype: timestamp[s]
- name: closed_at
dtype: timestamp[s]
- name: author_association
dtype: string
- name: active_lock_reason
dtype: 'null'
- name: body
dtype: string
- name: reactions
struct:
- name: url
dtype: string
- name: total_count
dtype: int64
- name: '+1'
dtype: int64
- name: '-1'
dtype: int64
- name: laugh
dtype: int64
- name: hooray
dtype: int64
- name: confused
dtype: int64
- name: heart
dtype: int64
- name: rocket
dtype: int64
- name: eyes
dtype: int64
- name: timeline_url
dtype: string
- name: performed_via_github_app
dtype: 'null'
- name: state_reason
dtype: string
- name: draft
dtype: bool
- name: pull_request
struct:
- name: url
dtype: string
- name: html_url
dtype: string
- name: diff_url
dtype: string
- name: patch_url
dtype: string
- name: merged_at
dtype: timestamp[s]
- name: is_pull_request
dtype: bool
splits:
- name: train
num_bytes: 587645
num_examples: 200
download_size: 138768
dataset_size: 587645
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
The dataset provides detailed information about issues and pull requests on GitHub, including URLs, IDs, titles, user information, state, locked status, comments, creation and update timestamps, author association, reactions (like thumbs up and hearts), timeline URLs, etc. The dataset is divided into a training set with 200 examples, totaling 587645 bytes, with download and actual sizes being 138768 and 587645 bytes respectively.
提供机构:
scriptmoney
原始信息汇总
数据集特征概述
基本特征
- url: 字符串类型
- repository_url: 字符串类型
- labels_url: 字符串类型
- comments_url: 字符串类型
- events_url: 字符串类型
- html_url: 字符串类型
- id: 整数类型
- node_id: 字符串类型
- number: 整数类型
- title: 字符串类型
用户特征
- user: 结构体类型,包含以下子特征:
- login: 字符串类型
- id: 整数类型
- node_id: 字符串类型
- avatar_url: 字符串类型
- gravatar_id: 字符串类型
- url: 字符串类型
- html_url: 字符串类型
- followers_url: 字符串类型
- following_url: 字符串类型
- gists_url: 字符串类型
- starred_url: 字符串类型
- subscriptions_url: 字符串类型
- organizations_url: 字符串类型
- repos_url: 字符串类型
- events_url: 字符串类型
- received_events_url: 字符串类型
- type: 字符串类型
- site_admin: 布尔类型
状态与标识特征
- labels: 序列类型,值为null
- state: 字符串类型
- locked: 布尔类型
- assignee: 值为null
- assignees: 序列类型,值为null
- milestone: 值为null
- comments: 序列类型,字符串类型
时间相关特征
- created_at: 时间戳类型
- updated_at: 时间戳类型
- closed_at: 时间戳类型
其他特征
- author_association: 字符串类型
- active_lock_reason: 值为null
- body: 字符串类型
- reactions: 结构体类型,包含以下子特征:
- url: 字符串类型
- total_count: 整数类型
- +1: 整数类型
- -1: 整数类型
- laugh: 整数类型
- hooray: 整数类型
- confused: 整数类型
- heart: 整数类型
- rocket: 整数类型
- eyes: 整数类型
- timeline_url: 字符串类型
- performed_via_github_app: 值为null
- state_reason: 字符串类型
- draft: 布尔类型
拉取请求特征
- pull_request: 结构体类型,包含以下子特征:
- url: 字符串类型
- html_url: 字符串类型
- diff_url: 字符串类型
- patch_url: 字符串类型
- merged_at: 时间戳类型
数据集分割
- train: 数据大小为587645字节,包含200个示例
数据集大小
- download_size: 138768字节
- dataset_size: 587645字节
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*



