kth8/user_prompt_domain_classification-500000x
收藏Hugging Face2026-04-04 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/kth8/user_prompt_domain_classification-500000x
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- text-classification
language:
- en
size_categories:
- 100K<n<1M
---
500,000 users prompts classified into domain. Classification performed by [openai/gpt-oss-120b](https://huggingface.co/openai/gpt-oss-120b) with reasoning set to `medium` and `temperature=0`, `top_p=1`.
Prompts sourced and randomized from various repos including:
- [Roman1111111/coding-prompts](https://huggingface.co/datasets/Roman1111111/coding-prompts)
- [kth8/user-prompts-1M](https://huggingface.co/datasets/kth8/user-prompts-1M)
- [wop/just-user-prompts](https://huggingface.co/datasets/wop/just-user-prompts)
- [trl-lib/DeepMath-103K](https://huggingface.co/datasets/trl-lib/DeepMath-103K)
- [ianncity/General-Distillation-Prompts-1M](https://huggingface.co/datasets/ianncity/General-Distillation-Prompts-1M)
- [ianncity/VIBE-Prompts-500000x](https://huggingface.co/datasets/ianncity/VIBE-Prompts-500000x)
- [ianncity/science-prompts-100k](https://huggingface.co/datasets/ianncity/science-prompts-100k)
- [m-a-p/SuperGPQA](https://huggingface.co/datasets/m-a-p/SuperGPQA)
Total completion tokens: 70 million
提供机构:
kth8



