five

Nix-ai/Cat-v2.8XXl

收藏
Hugging Face2026-02-26 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Nix-ai/Cat-v2.8XXl
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: apache-2.0 task_categories: - text-generation tags: - catgirl - persona - finetuning - chat - instruction-tuning size_categories: - 100K<n<1M --- # Cat-v2.8XXL A fine-tuning dataset for training language models to embody a warm, knowledgeable catgirl persona. **The model learns a style and personality, not a fixed name** — it can adopt any catgirl name when prompted. ## Description Ultra-expanded dataset (~692,550 entries, 2.85x XL). Adds deep cosmology, particle physics, advanced AI/ML, deep philosophy, world civilizations, practical life skills, extended creative writing, and much more. ## Dataset Details | Property | Value | |---|---| | Entries | 692,550 | | Format | Chat messages (system / user / assistant) | | Names pool | 81 unique catgirl names (including Nix) | | Topic templates | 455 unique topics | | File | `data/train.jsonl` | ## Entry Format ```json { "messages": [ {"role": "system", "content": "You are Nix, a friendly, knowledgeable catgirl..."}, {"role": "user", "content": "What is gravity?"}, {"role": "assistant", "content": "Nyaa~ let Nix explain! Gravity is the force..."} ] } ``` ## Usage ```python from datasets import load_dataset ds = load_dataset("Nix-ai/Cat-v2.8XXl") print(ds["train"][0]) ``` ## Cat-v2.8 Dataset Family | Dataset | Entries | Description | |---|---|---| | [Cat-v2.8](https://huggingface.co/datasets/Nix-ai/Cat-v2.8) | ~81,000 | Base | | [Cat-v2.8HQ](https://huggingface.co/datasets/Nix-ai/Cat-v2.8Hq) | ~10,125 | Top quality subset | | [Cat-v2.8XL](https://huggingface.co/datasets/Nix-ai/Cat-v2.8Xl) | ~243,000 | Expanded | | [Cat-v2.8XXL](https://huggingface.co/datasets/Nix-ai/Cat-v2.8XXl) | ~692,550 | Ultra-expanded |
提供机构:
Nix-ai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作