PJMixers/NobodyExistsOnTheInternet_full120k-filtered-ShareGPT
收藏Hugging Face2024-04-14 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/PJMixers/NobodyExistsOnTheInternet_full120k-filtered-ShareGPT
下载链接
链接失效反馈官方服务:
资源简介:
---
size_categories:
- 10K<n<100K
language:
- en
tags:
- not-for-all-audiences
---
Filtered with this python script: https://gist.github.com/xzuyn/b6d727a515987c58064d44dbad02690b
```
Amount Kept: 69827
Amount Removed: 50484
String which caused removal:
- however: 8239
- shivers down: 7029
- consensual: 6480
- meanwhile: 3463
- wanton: 2694
- her sex: 1880
- wild abandon: 1284
- It's important to: 1264
- controversial: 1127
- slick folds: 1099
- in a rhythm: 1021
- respectful: 956
- keep in mind: 888
- ministrations: 858
- ethical: 769
- diversity: 727
- dance of pleasure: 692
- prioritize safety: 690
- once upon: 685
- it is important to: 535
- gpt: 440
- with reckless abandon: 433
- fiery red hair: 416
- sent shockwaves: 386
- comply: 335
- empowerment: 317
- ethically: 288
- biases: 282
- regulations: 260
- puckered hole: 237
- Please note: 232
- inappropriate: 218
- morally: 199
- torn between: 188
- lay ahead: 184
- ensure the safety: 171
- harmful: 152
- exhausted and spent: 150
- derogatory: 149
- diversity and: 146
- rivulets of: 132
- illegal: 125
- ethics: 112
- threatens to consume: 110
- bias: 106
- I cannot: 101
- her wet heat: 100
- breathless and eager: 97
- complying: 95
- language model: 94
- potentially harmful: 94
- unacceptable: 88
- inclusivity: 87
- not provide: 87
- morals: 67
- stereotypes: 66
- discriminate: 63
- lgbt: 54
- not be suitable: 52
- As a machine: 51
- unethical: 51
- nestled deep within: 50
- racial: 44
- my programming: 43
- grins wickedly: 42
- discrimination: 41
- potentially dangerous: 40
- worth noting: 37
- offensive: 32
- safe spaces: 31
- As an AI: 31
- I'm an: 28
- legality: 28
- take your pleasure: 28
- cause harm: 27
- purely hypothetical: 27
- real-world consequences: 25
- half-lidded eyes: 24
- openai: 22
- sensitive topic: 21
- an ethereal beauty: 21
- the choice is yours: 20
- I'm sorry,: 20
- our values: 19
- It is important for: 19
- transgender: 17
- entertainment purposes: 17
- dusky nipples: 15
- I am an: 15
- feminist: 15
- for what seemed like an eternity: 14
- knuckles turning white: 13
- follow ethical guidelines: 12
- glorify: 12
- like an electric shock: 11
- a bruising kiss: 11
- cheeks hollowing: 11
- certainly not: 10
- capitalism: 10
- prioritize ethical: 8
- life would never be the same again: 8
- racism: 8
- long lashes: 8
- the night is still young: 7
- dangerous activities: 6
- not acceptable: 6
- can't provide: 6
- ESG: 6
- admit it: 6
- my purpose: 6
- social responsibility: 5
- gender stereotype: 5
- communist: 5
- without waiting for a response: 5
- not appropriate: 5
- divisive: 5
- dangerous or harmful: 5
- warring with: 4
- important to remember that: 4
- the world narrows: 4
- promote safety: 4
- the ball is in your court: 4
- gender-based: 3
- chestnut eyes: 3
- the game is on: 3
- hate speech: 3
- harmful consequences: 3
- whispering words of passion: 2
- Ensuring the ethical: 2
- ethical principles: 2
- won't provide: 2
- extremist: 2
- It is not possible: 2
- not be appropriate: 2
- feminism: 2
- my guidelines: 2
- was soft and gentle: 2
- hateful: 2
- prioritize user well-being: 1
- inclusive workplace: 1
- a language model: 1
- hurtful: 1
- discriminatory: 1
- my main goal: 1
- an AI language: 1
- audible pop: 1
- bites your ear: 1
- kiss-bruised lips: 1
- AI assistant: 1
- jeopardize the safety: 1
- illegality: 1
- legal and ethical: 1
- sexism: 1
- gender inequality: 1
- propriety be damned: 1
- ...for now.: 1
- promote the well-being: 1
```
提供机构:
PJMixers
原始信息汇总
数据集概述
数据集基本信息
- 大小范围: 10K<n<100K
- 语言: 英语
- 标签: 不适合所有观众
数据筛选
- 保留数据量: 69827
- 移除数据量: 50484
移除原因
数据集中因包含以下字符串而被移除:
- 然而: 8239
- 寒意: 7029
- 自愿的: 6480
- 同时: 3463
- 放纵的: 2694
- 她的性: 1880
- 放纵: 1284
- 重要的是: 1264
- 有争议的: 1127
- 光滑的褶皱: 1099
- 有节奏地: 1021
- 尊重的: 956
- 记住: 888
- 服务: 858
- 伦理的: 769
- 多样性: 727
- 快乐之舞: 692
- 优先考虑安全: 690
- 从前: 685
- 重要的是: 535
- GPT: 440
- 鲁莽地: 433
- 火红的头发: 416
- 产生冲击波: 386
- 遵守: 335
- 赋权: 317
- 伦理地: 288
- 偏见: 282
- 规定: 260
- 皱缩的洞: 237
- 请注意: 232
- 不适当: 218
- 道德上: 199
- 在...之间撕裂: 188
- 在前面: 184
- 确保安全: 171
- 有害的: 152
- 筋疲力尽: 150
- 贬低的: 149
- 多样性和: 146
- 细流: 132
- 非法的: 125
- 伦理: 112
- 威胁要吞噬: 110
- 偏见: 106
- 我不能: 101
- 她的湿热: 100
- 喘息和渴望: 97
- 遵守: 95
- 语言模型: 94
- 潜在有害: 94
- 不可接受: 88
- 包容性: 87
- 不提供: 87
- 道德: 67
- 刻板印象: 66
- 歧视: 63
- LGBT: 54
- 可能不适合: 52
- 作为机器: 51
- 不道德的: 51
- 深藏其中: 50
- 种族的: 44
- 我的编程: 43
- 邪恶地笑: 42
- 歧视: 41
- 潜在危险: 40
- 值得注意: 37
- 冒犯性: 32
- 安全空间: 31
- 作为AI: 31
- 我是一个: 28
- 合法性: 28
- 享受你的快乐: 28
- 造成伤害: 27
- 纯假设的: 27
- 现实后果: 25
- 半闭的眼睛: 24
- OpenAI: 22
- 敏感话题: 21
- 一种超凡的美: 21
- 选择权在你: 20
- 对不起: 20
- 我们的价值观: 19
- 对...很重要: 19
- 跨性别: 17
- 娱乐目的: 17
- 暗淡的乳头: 15
- 我是一个: 15
- 女权主义者: 15
- 似乎永恒: 14
- 指节变白: 13
- 遵循伦理指南: 12
- 美化: 12
- 像电击一样: 11
- 淤青的吻: 11
- 脸颊凹陷: 11
- 当然不是: 10
- 资本主义: 10
- 优先考虑伦理: 8
- 生活再也不会一样: 8
- 种族主义: 8
- 长睫毛: 8
- 夜晚还很年轻: 7
- 危险活动: 6
- 不可接受: 6
- 不能提供: 6
- ESG: 6
- 承认: 6
- 我的目的: 6
- 社会责任: 5
- 性别刻板印象: 5
- 共产主义者: 5
- 不等回应: 5
- 不适当: 5
- 分裂的: 5
- 危险或有害: 5
- 与...斗争: 4
- 重要的是记住: 4
- 世界缩小: 4
- 促进安全: 4
- 球在你这边: 4
- 基于性别的: 3
- 栗色眼睛: 3
- 游戏开始: 3
- 仇恨言论: 3
- 有害后果: 3
- 低语激情的话: 2
- 确保伦理: 2
- 伦理原则: 2
- 不会提供: 2
- 极端主义者: 2
- 不可能: 2
- 不适当: 2
- 女权主义: 2
- 我的指南: 2
- 柔软温和: 2
- 仇恨的: 2
- 优先考虑用户福祉: 1
- 包容的工作场所: 1
- 一种语言模型: 1
- 伤害性的: 1
- 歧视性的: 1
- 我的主要目标: 1
- 一种AI语言: 1
- 可听的砰声: 1
- 咬你的耳朵: 1
- 吻痕的嘴唇: 1
- AI助手: 1
- 危及安全: 1
- 非法性: 1
- 法律和伦理: 1
- 性别歧视: 1
- 性别不平等: 1
- 不顾礼仪: 1
- ...现在: 1
- 促进福祉: 1



