Edgerunners/Phoebus-127k-labels
收藏Hugging Face2024-05-31 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Edgerunners/Phoebus-127k-labels
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-4.0
---
Phoebus-127k but with labels added so users can connect chapters together.
raw human created erotica stories, needs filtering
things that need to be filtered:
1. product spambots, the website was being spammed a few times (usually with html tags)
2. warning only pages ("Warning" and nothing else)
3. edits (authors adding editorial history footnotes)
4. patreon and alike callouts (author asking for donations)
5. author notes, summaries, tagging
---
1. The Dataset is provided ""AS IS"" and ""AS AVAILABLE"" without warranty of any kind, express or implied, including but not limited to warranties of merchantability, fitness for a particular purpose, title, or non-infringement.
2. The Provider disclaims all liability for any damages or losses resulting from the use or misuse of the Dataset, including but not limited to any damages or losses arising from the use of the Dataset for purposes other than those intended by the Provider.
3. The Provider does not endorse or condone the use of the Dataset for any purpose that violates applicable laws, regulations, or ethical standards.
4. The Provider does not warrant that the Dataset will meet your specific requirements or that it will be error-free or that it will function without interruption.
5. You assume all risks associated with the use of the Dataset, including but not limited to any loss of data, loss of business, or damage to your reputation.
提供机构:
Edgerunners
原始信息汇总
数据集概述
数据集名称
Phoebus-127k
数据集特点
- 包含标签,便于用户连接章节。
- 原始数据为人类创作的情色故事。
- 需要进行内容过滤。
需要过滤的内容
- 产品垃圾邮件(通常包含HTML标签)。
- 警告页面(仅包含“Warning”字样)。
- 编辑历史脚注。
- Patreon等捐赠请求。
- 作者笔记、摘要、标签。
许可证
cc-by-nc-4.0



