DataHunterID/OpenO1-SFT-Indo
收藏Hugging Face2024-12-07 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/DataHunterID/OpenO1-SFT-Indo
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是OpenO1-SFT数据集的印尼语翻译版本,旨在通过SFT(监督微调)技术增强语言模型在印尼语中生成连贯和逻辑推理序列的能力。数据集的设计目标是帮助模型学习生成详细和结构化的推理步骤,从而提升其在复杂推理任务中的表现。当前版本包含约1,000条记录,目标总记录数为77,685条。数据集使用<Thought> </Thought>和<Output> </Output>分隔符来区分思维过程和最终答案。翻译过程使用了Gemini Pro 1.5 API、GPT-4o和GPT-4o-Mini等先进AI模型,但可能存在翻译不准确的情况,建议用户在使用前进行额外验证。
This dataset is an Indonesian translation of the OpenO1-SFT dataset, designed to enhance the ability of language models to generate coherent and logical reasoning sequences in Indonesian using SFT (Supervised Fine-Tuning). The dataset aims to help models learn to produce detailed and structured reasoning steps, thereby improving their performance on complex reasoning tasks. The current version contains approximately 1,000 entries, with a total target of 77,685 entries. The dataset uses <Thought> </Thought> and <Output> </Output> delimiters to separate the thinking process from the final answer. The translation process employs advanced AI models such as Gemini Pro 1.5 API, GPT-4o, and GPT-4o-Mini, but may contain inaccuracies, so users are advised to perform additional validation before use.
提供机构:
DataHunterID



