five

YCWTG/Better-Ruozhiba-ChatML

收藏
Hugging Face2026-04-06 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/YCWTG/Better-Ruozhiba-ChatML
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - zh tags: - chatml - sft - chinese - dialogue task_categories: - text-generation size_categories: - 1K<n<10K pretty_name: Better-Ruozhiba-ChatML license: other --- # Better-Ruozhiba-ChatML This dataset is a ChatML-style conversion of `FunnySaltyFish/Better-Ruozhiba`, prepared for supervised fine-tuning (SFT). ## Data Source - Upstream dataset: `FunnySaltyFish/Better-Ruozhiba` - This repo keeps the same content and rewrites each sample into a `messages` conversation format. ## Format Main file: `train.json` Each record has the following schema: ```json { "messages": [ {"role": "user", "content": "..."}, {"role": "assistant", "content": "..."} ] } ``` ## Statistics - Samples: 1,475 - Language: Chinese (`zh`) - Average user message length: ~25 characters - Average assistant message length: ~84 characters ## Intended Use (SFT) - Single-turn chat SFT - Instruction-following / conversational tuning with ChatML-compatible pipelines ## Loading Example ```python from datasets import load_dataset ds = load_dataset("YCWTG/Better-Ruozhiba-ChatML", split="train") print(ds[0]["messages"]) ``` ## Notes - This dataset is for research and model training purposes. - Please check and follow the original upstream repository's terms, attribution requirements, and any license constraints before commercial use.
提供机构:
YCWTG
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作