fxmeng/CodeFeedback-Python105K
收藏Hugging Face2024-11-14 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/fxmeng/CodeFeedback-Python105K
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是从`m-a-p/CodeFeedback-Filtered-Instruction`数据集中提取的一个子集,包含了104,848个用Python编写的样本。这些样本来源于四个开源代码指令调优数据集:Magicoder-OSS-Instruct、Python code subset of ShareGPT、Magicoder-Evol-Instruct和Evol-Instruct-Code。数据集的特征包括查询(query)和响应(response),主要用于问答任务,语言为英语,规模在10K到100K之间。
This dataset is a subset derived from the `m-a-p/CodeFeedback-Filtered-Instruction` dataset, which contains 156,526 samples. Specifically, this subset selects only the 104,848 samples written in Python. The dataset is primarily used for code instruction query and response tasks, featuring two main features: query and response. The dataset is divided into a training set (train) containing 104,848 samples. The language of the dataset is English, and its size is between 10K and 100K.
提供机构:
fxmeng



