vikp/python_code_instructions_filtered
收藏Hugging Face2023-08-31 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/vikp/python_code_instructions_filtered
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: output
dtype: string
- name: instruction
dtype: string
- name: kind
dtype: string
splits:
- name: train
num_bytes: 313731517
num_examples: 170635
download_size: 160726948
dataset_size: 313731517
---
# Dataset Card for "code_filtered"
This includes data from [xlcost](https://huggingface.co/datasets/vikp/xlcost_filtered_2k), [evol instruct](https://huggingface.co/datasets/vikp/evol_instruct_code_filtered_39k), [code alpaca](https://huggingface.co/datasets/vikp/evol_codealpaca_filtered_87k), [code instructions](https://huggingface.co/datasets/vikp/code_instructions_filtered_7k), and [code search net](https://huggingface.co/datasets/vikp/code_search_net_filtered_34k). Data is filtered based on quality and learning value.
The dataset code_filtered includes data from multiple sources such as xlcost, evol instruct, code alpaca, code instructions, and code search net. The data has been filtered for quality and learning value. The dataset features include output, instruction, and kind, all of which are string types. The dataset is split into a training set with 170635 examples, totaling 313731517 bytes.
提供机构:
vikp
原始信息汇总
数据集概述
数据集信息
- 特征:
output: 数据类型为字符串instruction: 数据类型为字符串kind: 数据类型为字符串
数据分割
- 训练集:
- 字节数: 313731517
- 样本数: 170635
数据集大小
- 下载大小: 160726948 字节
- 数据集大小: 313731517 字节



