five

ajibawa-2023/Code-74k-ShareGPT

收藏
Hugging Face2023-12-08 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ajibawa-2023/Code-74k-ShareGPT
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-nc-nd-4.0 task_categories: - conversational language: - en tags: - code size_categories: - 10K<n<100K --- **Code-74k-ShareGPT** This dataset is in Vicuna/ShareGPT format. There are around 74000 set of conversations. Each set having 2 conversations. Along with Python, Java, JavaScript, GO, C++, Rust etc. code with detailed explanation are provided. It is built upon using my existing Dataset [Python-Code-23k-ShareGPT](https://huggingface.co/datasets/ajibawa-2023/Python-Code-23k-ShareGPT). Additional dataset was generated using GPT-3.5, GPT-4 etc.

The Code-74k-ShareGPT dataset contains around 74000 sets of conversations, each set having two parts. These conversations involve multiple programming languages such as Python, Java, JavaScript, GO, C++, and Rust, along with detailed explanations. The dataset is built upon an existing dataset, Python-Code-23k-ShareGPT, and additional data was generated using models like GPT-3.5 and GPT-4.
提供机构:
ajibawa-2023
原始信息汇总

数据集概述

基本信息

  • 许可证: cc-by-nc-nd-4.0
  • 任务类别: conversational
  • 语言: en
  • 标签: code
  • 规模类别: 10K<n<100K

详细描述

  • 名称: Code-74k-ShareGPT
  • 格式: Vicuna/ShareGPT
  • 内容: 包含约74000组对话,每组包含2段对话。
  • 代码类型: 包括Python, Java, JavaScript, GO, C++, Rust等,并附有详细解释。
  • 来源: 基于现有数据集Python-Code-23k-ShareGPT构建,并使用GPT-3.5, GPT-4等生成额外数据。
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作