five

Original Prompts

收藏
arXiv2025-09-30 收录
下载链接:
https://arxiv.org/abs/2412.08127
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了从WikiText-103语料库中抽取的5,000个序列,这些序列包含了为语言模型设计的原始提示。经过基于BLEU分数的过滤过程后,数据集最终由2,473组原始提示、自动提示及其后续内容组成。该数据集的规模为5,000个序列,其任务是训练语言模型的自动提示功能。

This dataset includes 5,000 sequences extracted from the WikiText-103 corpus, with each sequence containing original prompts tailored for language models. Following a filtering procedure based on BLEU scores, the finalized dataset comprises 2,473 sets of original prompts, auto-generated prompts and their associated subsequent content. With a total of 5,000 sequences, this dataset is developed for training the automatic prompt generation capability of language models.
提供机构:
Publicly available via supplementary materials upon publication
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作