ajibawa-2023/Code-74k-ShareGPT
收藏Hugging Face2023-12-08 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ajibawa-2023/Code-74k-ShareGPT
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-nd-4.0
task_categories:
- conversational
language:
- en
tags:
- code
size_categories:
- 10K<n<100K
---
**Code-74k-ShareGPT**
This dataset is in Vicuna/ShareGPT format. There are around 74000 set of conversations. Each set having 2 conversations.
Along with Python, Java, JavaScript, GO, C++, Rust etc. code with detailed explanation are provided. It is built upon using my existing Dataset [Python-Code-23k-ShareGPT](https://huggingface.co/datasets/ajibawa-2023/Python-Code-23k-ShareGPT).
Additional dataset was generated using GPT-3.5, GPT-4 etc.
The Code-74k-ShareGPT dataset contains around 74000 sets of conversations, each set having two parts. These conversations involve multiple programming languages such as Python, Java, JavaScript, GO, C++, and Rust, along with detailed explanations. The dataset is built upon an existing dataset, Python-Code-23k-ShareGPT, and additional data was generated using models like GPT-3.5 and GPT-4.
提供机构:
ajibawa-2023
原始信息汇总
数据集概述
基本信息
- 许可证: cc-by-nc-nd-4.0
- 任务类别: conversational
- 语言: en
- 标签: code
- 规模类别: 10K<n<100K
详细描述
- 名称: Code-74k-ShareGPT
- 格式: Vicuna/ShareGPT
- 内容: 包含约74000组对话,每组包含2段对话。
- 代码类型: 包括Python, Java, JavaScript, GO, C++, Rust等,并附有详细解释。
- 来源: 基于现有数据集Python-Code-23k-ShareGPT构建,并使用GPT-3.5, GPT-4等生成额外数据。



