five

rafy2342/Reddit-SFW-Writing_Prompts_ShareGPT_Curated

收藏
Hugging Face2026-04-16 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/rafy2342/Reddit-SFW-Writing_Prompts_ShareGPT_Curated
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: mit size_categories: - 100K<n<1M task_categories: - text-generation - question-answering dataset_info: features: - name: prompt dtype: string - name: title dtype: string - name: body dtype: string splits: - name: train num_examples: 177477 --- # Normalized SFW Reddit Writing Prompts ## Dataset Description This dataset is a normalized, flattened version of curated Reddit writing prompts, specifically derived from `ChaoticNeutrals/Reddit-SFW-Writing_Prompts_ShareGPT`. It maps nested conversational arrays into a strict instruction-response schema, making it highly optimized for instruction-tuning Large Language Models. ### Dataset Schema | Column Name | Type | Description | | :--- | :--- | :--- | | `prompt` | `string` | The input prompt, user request, or human instruction. | | `title` | `string` | The title of the post (Empty in this subset but kept for schema consistency). | | `body` | `string` | The generated story, assistant response, or target text. | ## Example Data **Prompt:** > After being dead for several months, you awake, with no memories of where you are, or why you are dead. Terrified, you break out of your casket and out of the ground, only to turn around and read the name on the gravestone. **Body:** > *I can't breathe.* I take another rapid breath, trying to obtain enough oxygen to quell my throbbing lungs. Every breath, however, feels like it's tearing a hole through my body, taking my calm with it. I open my eyes to be met with darkness. It's taunting. "Hello?" *breath* "Help!" *breath* "... ## Curation Process This dataset was processed via an automated Python pipeline using the `datasets` library. The pipeline unpacks list-of-dictionaries formatted as `{'from': 'human', 'value': '...'}` and standardizes them into flat rows. Completely empty prompts/bodies were filtered out to ensure high data quality.
提供机构:
rafy2342
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作