ankitdhiman/nemotron-post-training-dataset-v1-processed
收藏Hugging Face2025-10-14 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/ankitdhiman/nemotron-post-training-dataset-v1-processed
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是NVIDIA Nemotron后训练数据集v1的子集,包含了chat和tools两个部分,去除了thinking tokens和用户消息。每个部分都包含有多个字段,如uuid、license、generator等,以及嵌套的消息和工具调用结构。数据集分为训练集,提供了相应的字节数和示例数。
This dataset is a subset of the NVIDIA Nemotron Post-Training Dataset v1, including chat and tools sections, with thinking tokens and user messages removed. Each section contains multiple fields such as uuid, license, generator, etc., and nested message and tool call structures. The dataset is split into a training set, with provided byte size and number of examples.
提供机构:
ankitdhiman



