Eurolingua/Toucan
收藏Hugging Face2026-03-09 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Eurolingua/Toucan
下载链接
链接失效反馈官方服务:
资源简介:
# Conversion Artifacts
## Source
- Dataset: [Agent-Ark/Toucan-1.5M](https://huggingface.co/datasets/Agent-Ark/Toucan-1.5M)
- Dataset config: `SFT`
- Dataset split: `train`
- Dataset revision: `main`
## Template
- Model/tokenizer: `nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16`
- Model revision: `main`
- Chat template link: https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16/blob/main/tokenizer_config.json
- Saved chat template file: `chat_template.jinja`
## Files
- Converted dataset JSONL: `toucan_train_converted.jsonl`
- Statistics JSON: `stats.json`
- Chat template file: `chat_template.jinja`
- This README: `README.md`
## Statistics
```json
{
"converted": 119287,
"errors": 0,
"model_reference_matches": 224,
"rows_with_model_reference": 192,
"skipped": 0,
"total": 119287
}
```
提供机构:
Eurolingua



