LucidityAI/PIPKIN-30K-Creative
收藏Hugging Face2026-03-24 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/LucidityAI/PIPKIN-30K-Creative
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
language:
- en
pretty_name: PIPPA 30K
---
# PIPKIN 30K Creative
The PIPKIN 30K Creative dataset is the first dataset in a series of evolving, in-the-wild creative datasets.
This dataset is inspired by [Pygmalion's PIPPA dataset](https://huggingface.co/datasets/PygmalionAI/PIPPA) from 2023.
Data is collected by exchanging anonymous data for OSS model usage (GLM-5, GLM 4.7, DeepSeek V3.1, Qwen 3.5 397B, Kimi K2.5, etc).
This data shows 200+ million tokens of chat data.
The average amount of input tokens is ~10k.
The average amount of completion tokens is ~0.8k-1k.
There are 28k examples of unique conversations.
提供机构:
LucidityAI



