cognitivecomputations/allenai_tulu-3-sft-mixture-DolphinLabeled
收藏Hugging Face2025-01-06 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/cognitivecomputations/allenai_tulu-3-sft-mixture-DolphinLabeled
下载链接
链接失效反馈官方服务:
资源简介:
Tulu 3 SFT混合数据集是一个用于Tulu 3系列模型训练的多源多语言混合数据集。它包含了来自CoCoNot、FLAN v2、No Robots等不同数据源的样本,并经过dedupe.py和label.py脚本的预处理。数据集用于训练Llama 3.1模型系列,并遵循ODC-BY-1.0许可证。
The Tulu 3 SFT Mixture dataset is a multilingual mixture from various sources, used for training the Tulu 3 series of models. It includes samples from sources like CoCoNot, FLAN v2, No Robots, etc., and has been preprocessed with dedupe.py and label.py scripts. The dataset is used for training the Llama 3.1 model series and is licensed under ODC-BY-1.0.
提供机构:
cognitivecomputations



