five

Deepthink-Reasoning-Tamil

收藏
魔搭社区2026-01-06 更新2025-02-15 收录
下载链接:
https://modelscope.cn/datasets/prithivMLmods/Deepthink-Reasoning-Tamil
下载链接
链接失效反馈
官方服务:
资源简介:
# Deepthink-Reasoning-Tamil Dataset The **Deepthink-Reasoning-Tamil** dataset is a multilingual dataset that includes both **Tamil** and **Tanglish**. It is designed with a **dynamic instruction set**, making it adaptable for various **reasoning-based problem-solving tasks**. ### Key Features: - **Multilingual Support**: Includes **Tamil** and **Tanglish** for broader accessibility. - **Dynamic Instructions**: Designed to adapt to **various problem-solving tasks**. - **Advanced Translation Pipeline**: The dataset was processed using a **custom language pipeline** built with **Gemini Flash Experimental 2.0 Thinker**, ensuring high-quality **instruction translation**. - **Synthetic Reasoning Framework**: Inspired by the **thinking structure of QWrens QWQ models**, enhancing its synthetic reasoning capabilities. - **Conceptual Alignment**: Draws insights from **LIMO (Less is More for Reasoning)** for efficient reasoning approaches. ### Dataset References: The dataset is derived from and extends the following sources: - [Deepthink-Reasoning](https://huggingface.co/datasets/prithivMLmods/Deepthink-Reasoning) - [Deepthink-Reasoning-Ins](https://huggingface.co/datasets/prithivMLmods/Deepthink-Reasoning-Ins) This dataset serves as a robust foundation for **multilingual reasoning models**, **instruction tuning**, and **problem-solving tasks**, particularly in Tamil and Tanglish.

# Deepthink-Reasoning-Tamil 数据集 **Deepthink-Reasoning-Tamil** 数据集是一款同时涵盖泰米尔语(Tamil)与坦格利什语(Tanglish)的多语言数据集。该数据集采用动态指令集(dynamic instruction set)进行设计,可适配各类基于推理的问题求解任务。 ### 核心特性: - **多语言支持**:覆盖泰米尔语与坦格利什语,适配更广泛的使用场景。 - **动态指令设计**:可灵活适配多种问题求解任务。 - **高级翻译流水线**:本数据集通过基于Gemini Flash Experimental 2.0 Thinker构建的定制化语言流水线完成处理,确保指令翻译的高质量水准。 - **合成推理框架**:灵感源自QWrens QWQ模型(QWrens QWQ models)的思维结构,可有效增强其合成推理能力。 - **概念对齐**:借鉴LIMO(Less is More for Reasoning)的研究范式,采用高效的推理实现路径。 ### 数据集参考来源: 本数据集衍生自并扩展了以下开源资源: - [Deepthink-Reasoning](https://huggingface.co/datasets/prithivMLmods/Deepthink-Reasoning) - [Deepthink-Reasoning-Ins](https://huggingface.co/datasets/prithivMLmods/Deepthink-Reasoning-Ins) 本数据集可作为多语言推理模型、指令微调(instruction tuning)以及问题求解任务的坚实基础,尤其适用于泰米尔语与坦格利什语相关场景。
提供机构:
maas
创建时间:
2025-02-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作