five

agency888/TaoGPT-v1

收藏
Hugging Face2023-11-03 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/agency888/TaoGPT-v1
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - question-answering - text2text-generation - table-question-answering language: - en tags: - Science - TaoScience size_categories: - 1K<n<10K dataset_info: features: - name: answer dtype: string - name: text_mistral dtype: string - name: text dtype: string - name: text_finetuning dtype: string - name: question dtype: string splits: - name: train num_bytes: 1412556 num_examples: 1552 download_size: 476887 dataset_size: 1412556 --- # ToaGPT Dataset <!-- Provide a quick summary of the dataset. --> This dataset card aims to be a base template for new datasets. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/datasetcard_template.md?plain=1). ## Dataset Details ### Dataset Description <!-- Provide a longer summary of what this dataset is. --> - **Curated by:** [Adithya S K](https://github.com/adithya-s-k) - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [English] - **License:** [MIT] ### Dataset Sources [optional] <!-- Provide the basic links for the dataset. --> - **Repository:** [https://github.com/agencyxr/taogpt7B](https://github.com/agencyxr/taogpt7B) - **Demo [optional]:** [More Information Needed] ## Uses <!-- Address questions around how the dataset is intended to be used. --> This Dataset is Used to Finetune LLMs for Answering questions with respect to TaoScience ### Direct Use <!-- This section describes suitable use cases for the dataset. --> [More Information Needed] ## Dataset Structure <!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. --> List of Question and Answer Pairs [More Information Needed]
提供机构:
agency888
原始信息汇总

ToaGPT 数据集

数据集详情

数据集描述

  • 语言(NLP): 英语
  • 许可证: MIT

数据集结构

特征

  • answer: 字符串类型
  • text_mistral: 字符串类型
  • text: 字符串类型
  • text_finetuning: 字符串类型
  • question: 字符串类型

分割

  • train:
    • 字节数: 1412556
    • 样本数: 1552

大小

  • 下载大小: 476887
  • 数据集大小: 1412556

用途

该数据集用于微调大型语言模型,以回答与TaoScience相关的问题。

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作