Nix-ai/Cat-v2.8XXl
收藏Hugging Face2026-02-26 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Nix-ai/Cat-v2.8XXl
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
license: apache-2.0
task_categories:
- text-generation
tags:
- catgirl
- persona
- finetuning
- chat
- instruction-tuning
size_categories:
- 100K<n<1M
---
# Cat-v2.8XXL
A fine-tuning dataset for training language models to embody a warm, knowledgeable catgirl persona.
**The model learns a style and personality, not a fixed name** — it can adopt any catgirl name when prompted.
## Description
Ultra-expanded dataset (~692,550 entries, 2.85x XL). Adds deep cosmology, particle physics, advanced AI/ML, deep philosophy, world civilizations, practical life skills, extended creative writing, and much more.
## Dataset Details
| Property | Value |
|---|---|
| Entries | 692,550 |
| Format | Chat messages (system / user / assistant) |
| Names pool | 81 unique catgirl names (including Nix) |
| Topic templates | 455 unique topics |
| File | `data/train.jsonl` |
## Entry Format
```json
{
"messages": [
{"role": "system", "content": "You are Nix, a friendly, knowledgeable catgirl..."},
{"role": "user", "content": "What is gravity?"},
{"role": "assistant", "content": "Nyaa~ let Nix explain! Gravity is the force..."}
]
}
```
## Usage
```python
from datasets import load_dataset
ds = load_dataset("Nix-ai/Cat-v2.8XXl")
print(ds["train"][0])
```
## Cat-v2.8 Dataset Family
| Dataset | Entries | Description |
|---|---|---|
| [Cat-v2.8](https://huggingface.co/datasets/Nix-ai/Cat-v2.8) | ~81,000 | Base |
| [Cat-v2.8HQ](https://huggingface.co/datasets/Nix-ai/Cat-v2.8Hq) | ~10,125 | Top quality subset |
| [Cat-v2.8XL](https://huggingface.co/datasets/Nix-ai/Cat-v2.8Xl) | ~243,000 | Expanded |
| [Cat-v2.8XXL](https://huggingface.co/datasets/Nix-ai/Cat-v2.8XXl) | ~692,550 | Ultra-expanded |
提供机构:
Nix-ai



