vdaita/editpackft_inst_line
收藏Hugging Face2024-06-27 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/vdaita/editpackft_inst_line
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为editpackft_inst_line,包含训练集和测试集两个分割。训练集有4500个样本,测试集有500个样本。数据集的特征包括commit、old_file、new_file、old_contents、new_contents、subject、message、lang、license、repos、ndiff、instruction、content、patch、inst、INSTRUCTION和RESPONSE等字段。这些字段的数据类型均为字符串。数据集的下载大小为16340528字节,数据集总大小为36942059字节。
The dataset named editpackft_inst_line includes two splits: train and test. The train split contains 4500 examples, and the test split contains 500 examples. The features of the dataset include commit, old_file, new_file, old_contents, new_contents, subject, message, lang, license, repos, ndiff, instruction, content, patch, inst, INSTRUCTION, and RESPONSE, all of which are of string type. The download size of the dataset is 16340528 bytes, and the total dataset size is 36942059 bytes.
提供机构:
vdaita
原始信息汇总
数据集概述
数据集名称
editpackft_inst_line
数据集配置
- 默认配置
- 训练集路径:
data/train-* - 测试集路径:
data/test-*
- 训练集路径:
数据集特征
commit: 字符串类型old_file: 字符串类型new_file: 字符串类型old_contents: 字符串类型new_contents: 字符串类型subject: 字符串类型message: 字符串类型lang: 字符串类型license: 字符串类型repos: 字符串类型ndiff: 字符串类型instruction: 字符串类型content: 字符串类型patch: 字符串类型inst: 字符串类型INSTRUCTION: 字符串类型RESPONSE: 字符串类型
数据集分割
- 训练集
- 字节数: 33150906
- 样本数: 4500
- 测试集
- 字节数: 3791153
- 样本数: 500
数据集大小
- 下载大小: 16340528 字节
- 数据集总大小: 36942059 字节



