rafy2342/Reddit-SFW-Writing_Prompts_ShareGPT_Curated
收藏Hugging Face2026-04-16 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/rafy2342/Reddit-SFW-Writing_Prompts_ShareGPT_Curated
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
license: mit
size_categories:
- 100K<n<1M
task_categories:
- text-generation
- question-answering
dataset_info:
features:
- name: prompt
dtype: string
- name: title
dtype: string
- name: body
dtype: string
splits:
- name: train
num_examples: 177477
---
# Normalized SFW Reddit Writing Prompts
## Dataset Description
This dataset is a normalized, flattened version of curated Reddit writing prompts, specifically derived from `ChaoticNeutrals/Reddit-SFW-Writing_Prompts_ShareGPT`. It maps nested conversational arrays into a strict instruction-response schema, making it highly optimized for instruction-tuning Large Language Models.
### Dataset Schema
| Column Name | Type | Description |
| :--- | :--- | :--- |
| `prompt` | `string` | The input prompt, user request, or human instruction. |
| `title` | `string` | The title of the post (Empty in this subset but kept for schema consistency). |
| `body` | `string` | The generated story, assistant response, or target text. |
## Example Data
**Prompt:**
> After being dead for several months, you awake, with no memories of where you are, or why you are dead. Terrified, you break out of your casket and out of the ground, only to turn around and read the name on the gravestone.
**Body:**
> *I can't breathe.*
I take another rapid breath, trying to obtain enough oxygen to quell my throbbing lungs. Every breath, however, feels like it's tearing a hole through my body, taking my calm with it. I open my eyes to be met with darkness. It's taunting.
"Hello?"
*breath*
"Help!"
*breath*
"...
## Curation Process
This dataset was processed via an automated Python pipeline using the `datasets` library. The pipeline unpacks list-of-dictionaries formatted as `{'from': 'human', 'value': '...'}` and standardizes them into flat rows. Completely empty prompts/bodies were filtered out to ensure high data quality.
提供机构:
rafy2342



