MIldoc/rus_science_for_gpt_oss_20b
收藏Hugging Face2025-12-14 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/MIldoc/rus_science_for_gpt_oss_20b
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个俄语科学学术辅助数据集,包含约32,272个JSONL格式的示例,用于训练和微调大型语言模型(LLM)。数据集内容涵盖科学文本、学术风格、表格/方法描述、引言、解释和重述等。每个示例包含完整的对话上下文和助手的回答。数据集适用于科学写作、数据分析、引言和总结的生成任务。
This dataset is a Russian-language scientific and academic assistant dataset containing approximately 32,272 examples in JSONL format, designed for training and fine-tuning large language models (LLMs). The dataset covers scientific texts, academic style, table/method descriptions, introductions, explanations, and paraphrasing. Each example includes a complete dialogue context and the assistants response. The dataset is suitable for tasks such as scientific writing, data analysis, and generating introductions and summaries.
提供机构:
MIldoc



