almanach/topxgen-llama-4-scout-SBYS
收藏Hugging Face2025-10-15 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/almanach/topxgen-llama-4-scout-SBYS
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于机器翻译的合成数据集,旨在通过生成中间推理标记来微调大型语言模型。数据集包含了源句子、目标翻译、源语言、目标语言以及与翻译过程相关的多个阶段(预草稿研究、草稿翻译、精炼翻译、校对翻译和最佳翻译)。这些数据用于训练模型以生成推理和翻译。
This is a synthetic dataset for machine translation designed to fine-tune large language models by generating intermediate reasoning tokens. The dataset includes source sentences, target translations, source and target languages, and multiple stages related to the translation process (predrafting research, drafting, refinement, proofreading, and better translation). These data are used to train models to generate reasoning and translation.
提供机构:
almanach



