laion/CoderForge-Preview-v3-316
收藏Hugging Face2026-04-22 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/laion/CoderForge-Preview-v3-316
下载链接
链接失效反馈官方服务:
资源简介:
laion/CoderForge-Preview-v3-316数据集是togethercomputer/CoderForge-Preview数据集的一个子集,具体是trajectories-tokenized_qwencoder子集。它包含316行数据,原始数据有155,144行,分布在4个slug中。数据格式是预处理的Qwen3原生数据,每行包含多个字段如input_ids、attention_mask、labels等。数据集用于文本生成任务,支持axolotl框架的使用。
laion/CoderForge-Preview-v3-316 is a row-subset of the pre-tokenized trajectories in togethercomputer/CoderForge-Preview (trajectories-tokenized_qwencoder subset). It contains 316 rows (source: 155,144 across 4 slugs). The format is native pre-tokenized data for Qwen3, with per-row columns including input_ids, attention_mask, labels, etc. The dataset is used for text-generation tasks and supports the axolotl framework.
提供机构:
laion



