Losa10/G3P-Finetuning-examples

Name: Losa10/G3P-Finetuning-examples
Creator: Losa10
Published: 2025-12-19 08:33:42
License: 暂无描述

Hugging Face2025-12-19 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/Losa10/G3P-Finetuning-examples

下载链接

链接失效反馈

官方服务：

资源简介：

一个高质量的合成数据集，专为**指令微调**和**推理（CoT）**开发而设计。该数据集使用**Gemini 3 Pro**预览模型生成，专注于技术任务、复杂配置和逻辑分步问题解决。数据集包含两种配置：一种是包含完整推理链（Chain-of-Thought）的复杂任务，另一种是用于直接响应训练的标准指令-输出对。数据集支持英语和俄语，适用于监督微调（SFT）、推理能力训练和多语言对齐等应用场景。

A high-quality synthetic dataset designed for **Instruction Fine-Tuning** and **Reasoning (CoT)** development. Generated using the **Gemini 3 Pro** preview model, this dataset focuses on technical tasks, complex configurations, and logical step-by-step problem-solving. The dataset is split into two distinct configurations: one includes full reasoning chains (Chain-of-Thought) for complex tasks, and the other provides standard instruction-output pairs for direct response training. It supports both English and Russian languages and is intended for use cases such as Supervised Fine-Tuning (SFT), Reasoning Capability enhancement, and Multilingual Alignment.

提供机构：

Losa10

5,000+

优质数据集

54 个

任务类型

进入经典数据集