roborovski/phi-2-labeled
收藏Hugging Face2023-07-10 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/roborovski/phi-2-labeled
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: hexsha
dtype: string
- name: size
dtype: int64
- name: ext
dtype: string
- name: lang
dtype: string
- name: max_stars_repo_path
dtype: string
- name: max_stars_repo_name
dtype: string
- name: max_stars_repo_head_hexsha
dtype: string
- name: max_stars_repo_licenses
sequence: string
- name: max_stars_count
dtype: int64
- name: max_stars_repo_stars_event_min_datetime
dtype: string
- name: max_stars_repo_stars_event_max_datetime
dtype: string
- name: max_issues_repo_path
dtype: string
- name: max_issues_repo_name
dtype: string
- name: max_issues_repo_head_hexsha
dtype: string
- name: max_issues_repo_licenses
sequence: string
- name: max_issues_count
dtype: int64
- name: max_issues_repo_issues_event_min_datetime
dtype: string
- name: max_issues_repo_issues_event_max_datetime
dtype: string
- name: max_forks_repo_path
dtype: string
- name: max_forks_repo_name
dtype: string
- name: max_forks_repo_head_hexsha
dtype: string
- name: max_forks_repo_licenses
sequence: string
- name: max_forks_count
dtype: int64
- name: max_forks_repo_forks_event_min_datetime
dtype: string
- name: max_forks_repo_forks_event_max_datetime
dtype: string
- name: content
dtype: string
- name: avg_line_length
dtype: float64
- name: max_line_length
dtype: int64
- name: alphanum_fraction
dtype: float64
- name: label
dtype: int64
- name: cost
dtype: float64
splits:
- name: train
num_bytes: 283814677
num_examples: 50000
download_size: 112938830
dataset_size: 283814677
---
# Dataset Card for "phi-1"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
roborovski
原始信息汇总
数据集特征概述
数据集特征列表
- hexsha (字符串)
- size (整数)
- ext (字符串)
- lang (字符串)
- max_stars_repo_path (字符串)
- max_stars_repo_name (字符串)
- max_stars_repo_head_hexsha (字符串)
- max_stars_repo_licenses (字符串序列)
- max_stars_count (整数)
- max_stars_repo_stars_event_min_datetime (字符串)
- max_stars_repo_stars_event_max_datetime (字符串)
- max_issues_repo_path (字符串)
- max_issues_repo_name (字符串)
- max_issues_repo_head_hexsha (字符串)
- max_issues_repo_licenses (字符串序列)
- max_issues_count (整数)
- max_issues_repo_issues_event_min_datetime (字符串)
- max_issues_repo_issues_event_max_datetime (字符串)
- max_forks_repo_path (字符串)
- max_forks_repo_name (字符串)
- max_forks_repo_head_hexsha (字符串)
- max_forks_repo_licenses (字符串序列)
- max_forks_count (整数)
- max_forks_repo_forks_event_min_datetime (字符串)
- max_forks_repo_forks_event_max_datetime (字符串)
- content (字符串)
- avg_line_length (浮点数)
- max_line_length (整数)
- alphanum_fraction (浮点数)
- label (整数)
- cost (浮点数)
数据集分割信息
- train
- 字节数: 283814677
- 示例数: 50000
数据集大小
- 下载大小: 112938830
- 数据集大小: 283814677



