azharmo/tamil-orca

Name: azharmo/tamil-orca
Creator: azharmo
Published: 2024-03-25 07:27:01
License: 暂无描述

Hugging Face2024-03-25 更新2024-06-11 收录

下载链接：

https://hf-mirror.com/datasets/azharmo/tamil-orca

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 task_categories: - text-generation language: - ta tags: - orca - reasoning - tamil - generation pretty_name: Tamil-orca size_categories: - 10K<n<100K --- # Tamil Orca-Style Dataset ## Overview This repository hosts the Tamil Orca-style dataset, meticulously curated to enhance the reasoning capabilities of large language models in Tamil. The dataset is a fusion of translations and responses generated by GPT-4 and Gemini models. - **Content**: The dataset contains three columns - 'Instruction', 'Query', and 'Answer'. - **Purpose**: It's designed to significantly improve the reasoning capability of AI language models in Tamil. - **Usage**: If you utilize this dataset or any component of the Tamil-orca datasets in your research, please acknowledge it in your citations. ## Upcoming Research - Research based on this dataset is underway and will be published soon, contributing valuable insights into language model training and performance in Tamil. ## Credits Get to know the creators behind this innovative dataset/model and follow their contributions to the field: - **Creator**: Mohamed Azharudeen - **LinkedIn**: [Mohamed Azharudeen](https://www.linkedin.com/in/mohamed-azharudeen/)

提供机构：

azharmo

原始信息汇总

Tamil Orca-Style Dataset 概述

数据集基本信息

许可证: Apache-2.0
任务类别: 文本生成
语言: 泰米尔语
标签: orca, 推理, 泰米尔语, 生成
数据集大小: 10K<n<100K
美观名称: Tamil-orca

数据集内容与目的

内容: 包含三个列 - Instruction, Query, 和 Answer。
目的: 旨在显著提升泰米尔语AI语言模型的推理能力。

使用说明

若在研究中使用此数据集或其任何部分，请在引用中予以承认。

数据集创建者

创建者: Mohamed Azharudeen

5,000+

优质数据集

54 个

任务类型

进入经典数据集