five

azharmo/tamil-orca

收藏
Hugging Face2024-03-25 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/azharmo/tamil-orca
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - text-generation language: - ta tags: - orca - reasoning - tamil - generation pretty_name: Tamil-orca size_categories: - 10K<n<100K --- # Tamil Orca-Style Dataset ## Overview This repository hosts the Tamil Orca-style dataset, meticulously curated to enhance the reasoning capabilities of large language models in Tamil. The dataset is a fusion of translations and responses generated by GPT-4 and Gemini models. - **Content**: The dataset contains three columns - 'Instruction', 'Query', and 'Answer'. - **Purpose**: It's designed to significantly improve the reasoning capability of AI language models in Tamil. - **Usage**: If you utilize this dataset or any component of the Tamil-orca datasets in your research, please acknowledge it in your citations. ## Upcoming Research - Research based on this dataset is underway and will be published soon, contributing valuable insights into language model training and performance in Tamil. ## Credits Get to know the creators behind this innovative dataset/model and follow their contributions to the field: - **Creator**: Mohamed Azharudeen - **LinkedIn**: [Mohamed Azharudeen](https://www.linkedin.com/in/mohamed-azharudeen/)
提供机构:
azharmo
原始信息汇总

Tamil Orca-Style Dataset 概述

数据集基本信息

  • 许可证: Apache-2.0
  • 任务类别: 文本生成
  • 语言: 泰米尔语
  • 标签: orca, 推理, 泰米尔语, 生成
  • 数据集大小: 10K<n<100K
  • 美观名称: Tamil-orca

数据集内容与目的

  • 内容: 包含三个列 - Instruction, Query, 和 Answer。
  • 目的: 旨在显著提升泰米尔语AI语言模型的推理能力。

使用说明

  • 若在研究中使用此数据集或其任何部分,请在引用中予以承认。

数据集创建者

  • 创建者: Mohamed Azharudeen
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作