yatsy/GPT-wiki-intro-extension
收藏Hugging Face2023-10-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/yatsy/GPT-wiki-intro-extension
下载链接
链接失效反馈官方服务:
资源简介:
---
annotations_creators:
- no-annotation
language_creators:
- machine-generated
language:
- en
license:
- cc
multilinguality:
- monolingual
size_categories:
- 1K<n<10K
source_datasets:
- aadityaubhat/GPT-wiki-intro
task_categories:
- text-classification
task_ids: []
pretty_name: GPT-wiki-intro-extension
tags:
- facebook/opt-1.3b
- facebook/opt-2.7b
- facebook/opt-125m
- meta-llama/Llama-2-7b-chat-hf
- meta-llama/Llama-2-13b-chat-hf
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
dataset_info:
features:
- name: id
dtype: int64
- name: generated
dtype: string
- name: source
dtype: string
splits:
- name: train
num_bytes: 6375445
num_examples: 5000
download_size: 2174490
dataset_size: 6375445
---
# GPT Wiki Intro Extension
This dataset is extension of aadityaubhat/GPT-wiki-intro.
1000 promts processed through several LLM witout sampling. (opt-125m, opt-1.3b, opt-2.7b, llama2-7b-chat, and llama2-13b-chat)
Schema for the dataset
|Column |Datatype|Description |
|---------------------|--------|-------------------------------------------|
|id |int64 |ID from original dataset |
|generated |string |Model's output |
|source |string |opt-125m, opt-1.3b, opt-2.7b, llama2-7b or llama2-13b |
```
@misc {yatsy,
author = { {Kirill Safronov} },
title = { GPT-wiki-intro-extension },
year = 2023,
url = { https://huggingface.co/datasets/yatsy/GPT-wiki-intro-extension },
publisher = { Hugging Face }
}
```
提供机构:
yatsy
原始信息汇总
GPT Wiki Intro Extension 数据集概述
基本信息
- 数据集名称: GPT-wiki-intro-extension
- 语言: 英语
- 许可证: CC
- 多语言性: 单语种
- 数据集大小: 1K<n<10K
- 源数据集: aadityaubhat/GPT-wiki-intro
- 任务类别: 文本分类
数据集配置
- 配置名称: default
- 数据文件:
- 分割: train
- 路径: data/train-*
数据集信息
- 特征:
- id: int64
- generated: string
- source: string
- 分割:
- train:
- 字节数: 6375445
- 样本数: 5000
- train:
- 下载大小: 2174490
- 数据集大小: 6375445
数据集描述
该数据集是 aadityaubhat/GPT-wiki-intro 的扩展,通过多个大型语言模型(opt-125m, opt-1.3b, opt-2.7b, llama2-7b-chat, llama2-13b-chat)处理了 1000 个提示,未进行采样。
数据集结构
| 列名 | 数据类型 | 描述 |
|---|---|---|
| id | int64 | 原始数据集的ID |
| generated | string | 模型的输出 |
| source | string | opt-125m, opt-1.3b, opt-2.7b, llama2-7b 或 llama2-13b |



