DLI-Lab/COFFEE-Editor-sample
收藏Hugging Face2024-06-06 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/DLI-Lab/COFFEE-Editor-sample
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: problem_id
dtype: string
- name: description
dtype: string
- name: feedback
dtype: string
- name: wrong_code
dtype: string
- name: input_format
dtype: string
- name: user_id
dtype: string
- name: output_format
dtype: string
- name: index
dtype: int64
- name: correct_code
dtype: string
splits:
- name: train
num_bytes: 9736765
num_examples: 3600
- name: eval
num_bytes: 1094571
num_examples: 400
download_size: 3874981
dataset_size: 10831336
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: eval
path: data/eval-*
---
This dataset contains information related to programming problems, including problem ID, description, user feedback, wrong code, input and output formats, etc. It is divided into training and evaluation sets for model training and evaluation.
提供机构:
DLI-Lab
原始信息汇总
数据集概述
数据集特征
- problem_id: 数据类型为字符串
- description: 数据类型为字符串
- feedback: 数据类型为字符串
- wrong_code: 数据类型为字符串
- input_format: 数据类型为字符串
- user_id: 数据类型为字符串
- output_format: 数据类型为字符串
- index: 数据类型为整数
- correct_code: 数据类型为字符串
数据集划分
- 训练集 (train):
- 数据量: 3600个样本
- 存储大小: 9736765字节
- 评估集 (eval):
- 数据量: 400个样本
- 存储大小: 1094571字节
数据集大小
- 下载大小: 3874981字节
- 数据集总大小: 10831336字节
数据文件配置
- 默认配置 (default):
- 训练集路径:
data/train-* - 评估集路径:
data/eval-*
- 训练集路径:



