eitanturok/commitpackft|代码分析数据集|自然语言处理数据集
收藏hugging_face2024-03-06 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/eitanturok/commitpackft
下载链接
链接失效反馈资源简介:
---
dataset_info:
config_name: python
features:
- name: commit
dtype: string
- name: old_file
dtype: string
- name: new_file
dtype: string
- name: old_contents
dtype: string
- name: new_contents
dtype: string
- name: subject
dtype: string
- name: message
dtype: string
- name: lang
dtype: string
- name: license
dtype: string
- name: repos
dtype: string
- name: prompt
dtype: string
- name: response
dtype: string
- name: prompt_tagged
dtype: string
- name: response_tagged
dtype: string
- name: text
dtype: string
- name: text_tagged
dtype: string
splits:
- name: train
num_bytes: 509786862
num_examples: 56025
download_size: 222635526
dataset_size: 509786862
configs:
- config_name: python
data_files:
- split: train
path: python/train-*
---
# Dataset Card for "commitpackft"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
eitanturok
原始信息汇总
数据集概述
数据集信息
- 配置名称: python
- 特征列表:
commit: 字符串old_file: 字符串new_file: 字符串old_contents: 字符串new_contents: 字符串subject: 字符串message: 字符串lang: 字符串license: 字符串repos: 字符串prompt: 字符串response: 字符串prompt_tagged: 字符串response_tagged: 字符串text: 字符串text_tagged: 字符串
数据分割
- 训练集:
- 名称: train
- 字节数: 509786862
- 样本数: 56025
数据集大小
- 下载大小: 222635526 字节
- 数据集大小: 509786862 字节
配置详情
- 配置名称: python
- 数据文件:
- 分割: train
- 路径: python/train-*



