azharmo/tamil-orca
收藏Hugging Face2024-03-25 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/azharmo/tamil-orca
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- text-generation
language:
- ta
tags:
- orca
- reasoning
- tamil
- generation
pretty_name: Tamil-orca
size_categories:
- 10K<n<100K
---
# Tamil Orca-Style Dataset
## Overview
This repository hosts the Tamil Orca-style dataset, meticulously curated to enhance the reasoning capabilities of large language models in Tamil. The dataset is a fusion of translations and responses generated by GPT-4 and Gemini models.
- **Content**: The dataset contains three columns - 'Instruction', 'Query', and 'Answer'.
- **Purpose**: It's designed to significantly improve the reasoning capability of AI language models in Tamil.
- **Usage**: If you utilize this dataset or any component of the Tamil-orca datasets in your research, please acknowledge it in your citations.
## Upcoming Research
- Research based on this dataset is underway and will be published soon, contributing valuable insights into language model training and performance in Tamil.
## Credits
Get to know the creators behind this innovative dataset/model and follow their contributions to the field:
- **Creator**: Mohamed Azharudeen
- **LinkedIn**: [Mohamed Azharudeen](https://www.linkedin.com/in/mohamed-azharudeen/)
提供机构:
azharmo
原始信息汇总
Tamil Orca-Style Dataset 概述
数据集基本信息
- 许可证: Apache-2.0
- 任务类别: 文本生成
- 语言: 泰米尔语
- 标签: orca, 推理, 泰米尔语, 生成
- 数据集大小: 10K<n<100K
- 美观名称: Tamil-orca
数据集内容与目的
- 内容: 包含三个列 - Instruction, Query, 和 Answer。
- 目的: 旨在显著提升泰米尔语AI语言模型的推理能力。
使用说明
- 若在研究中使用此数据集或其任何部分,请在引用中予以承认。
数据集创建者
- 创建者: Mohamed Azharudeen



