five

LucidityAI/PIPKIN-30K-Creative

收藏
Hugging Face2026-03-24 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/LucidityAI/PIPKIN-30K-Creative
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit language: - en pretty_name: PIPPA 30K --- # PIPKIN 30K Creative The PIPKIN 30K Creative dataset is the first dataset in a series of evolving, in-the-wild creative datasets. This dataset is inspired by [Pygmalion's PIPPA dataset](https://huggingface.co/datasets/PygmalionAI/PIPPA) from 2023. Data is collected by exchanging anonymous data for OSS model usage (GLM-5, GLM 4.7, DeepSeek V3.1, Qwen 3.5 397B, Kimi K2.5, etc). This data shows 200+ million tokens of chat data. The average amount of input tokens is ~10k. The average amount of completion tokens is ~0.8k-1k. There are 28k examples of unique conversations.
提供机构:
LucidityAI
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作