agency888/TaoGPT-v1
收藏Hugging Face2023-11-03 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/agency888/TaoGPT-v1
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- question-answering
- text2text-generation
- table-question-answering
language:
- en
tags:
- Science
- TaoScience
size_categories:
- 1K<n<10K
dataset_info:
features:
- name: answer
dtype: string
- name: text_mistral
dtype: string
- name: text
dtype: string
- name: text_finetuning
dtype: string
- name: question
dtype: string
splits:
- name: train
num_bytes: 1412556
num_examples: 1552
download_size: 476887
dataset_size: 1412556
---
# ToaGPT Dataset
<!-- Provide a quick summary of the dataset. -->
This dataset card aims to be a base template for new datasets. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/datasetcard_template.md?plain=1).
## Dataset Details
### Dataset Description
<!-- Provide a longer summary of what this dataset is. -->
- **Curated by:** [Adithya S K](https://github.com/adithya-s-k)
- **Funded by [optional]:** [More Information Needed]
- **Shared by [optional]:** [More Information Needed]
- **Language(s) (NLP):** [English]
- **License:** [MIT]
### Dataset Sources [optional]
<!-- Provide the basic links for the dataset. -->
- **Repository:** [https://github.com/agencyxr/taogpt7B](https://github.com/agencyxr/taogpt7B)
- **Demo [optional]:** [More Information Needed]
## Uses
<!-- Address questions around how the dataset is intended to be used. -->
This Dataset is Used to Finetune LLMs for Answering questions with respect to TaoScience
### Direct Use
<!-- This section describes suitable use cases for the dataset. -->
[More Information Needed]
## Dataset Structure
<!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. -->
List of Question and Answer Pairs
[More Information Needed]
提供机构:
agency888
原始信息汇总
ToaGPT 数据集
数据集详情
数据集描述
- 语言(NLP): 英语
- 许可证: MIT
数据集结构
特征
- answer: 字符串类型
- text_mistral: 字符串类型
- text: 字符串类型
- text_finetuning: 字符串类型
- question: 字符串类型
分割
- train:
- 字节数: 1412556
- 样本数: 1552
大小
- 下载大小: 476887
- 数据集大小: 1412556
用途
该数据集用于微调大型语言模型,以回答与TaoScience相关的问题。



