GitBag/prompt-collection-v0.1
收藏Hugging Face2024-05-28 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/GitBag/prompt-collection-v0.1
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: dataset
dtype: string
- name: context
dtype: string
- name: context_messages
list:
- name: content
dtype: string
- name: role
dtype: string
- name: id
dtype: string
- name: llama_prompt
dtype: string
- name: llama_prompt_tokens
sequence: int64
splits:
- name: train
num_bytes: 1828018236.9845586
num_examples: 156545
- name: test
num_bytes: 5838635.0154414335
num_examples: 500
download_size: 362307491
dataset_size: 1833856872.0
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
---
The dataset includes multiple features such as dataset, context, context_messages, id, llama_prompt, llama_prompt_tokens, etc. Each feature has its specific data type. The dataset is divided into training and test sets, containing 156545 and 500 examples respectively. The size and download size of the dataset are also clearly recorded.
提供机构:
GitBag
原始信息汇总
数据集概述
数据集特征
- dataset: 数据类型 - 字符串
- context: 数据类型 - 字符串
- context_messages: 列表类型,包含以下子特征:
- content: 数据类型 - 字符串
- role: 数据类型 - 字符串
- id: 数据类型 - 字符串
- llama_prompt: 数据类型 - 字符串
- llama_prompt_tokens: 数据类型 - 序列(int64)
数据集划分
- 训练集 (train):
- 示例数量: 156545
- 数据大小: 1828018236.9845586 字节
- 测试集 (test):
- 示例数量: 500
- 数据大小: 5838635.0154414335 字节
数据集大小
- 下载大小: 362307491 字节
- 数据集总大小: 1833856872.0 字节
数据文件配置
- 默认配置 (default):
- 训练集路径:
data/train-* - 测试集路径:
data/test-*
- 训练集路径:



