OdiaGenAI/odia_master_data_llama2
收藏Hugging Face2023-09-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/OdiaGenAI/odia_master_data_llama2
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-sa-4.0
task_categories:
- text-generation
language:
- or
pretty_name: odia_master_data_llama2
size_categories:
- 100K<n<1M
---
# Dataset Card for odia_master_data_llama2
## Dataset Description
- **Homepage: https://www.odiagenai.org/**
- **Repository: https://github.com/shantipriyap/OdiaGenAI**
- **Point of Contact: Shantipriya Parida, and Sambit Sekhar**
### Dataset Summary
This dataset is a mix of Odia instruction sets translated from open-source instruction sets and Odia domain knowledge instruction sets.
The Odia instruction sets used are:
* odia_domain_context_train_v1
* dolly-odia-15k
* OdiEnCorp_translation_instructions_25k
* gpt-teacher-roleplay-odia-3k
* Odia_Alpaca_instructions_52k
* hardcode_odia_qa_105
In this dataset Odia instruction, input, and output strings are available.
### Supported Tasks and Leaderboards
Large Language Model (LLM)
### Languages
Odia
## Dataset Structure
JSON
### Data Fields
output (string)
instruction (string)
input (string)
### Licensing Information
This work is licensed under a
[Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License][cc-by-nc-sa].
[![CC BY-NC-SA 4.0][cc-by-nc-sa-image]][cc-by-nc-sa]
[cc-by-nc-sa]: http://creativecommons.org/licenses/by-nc-sa/4.0/
[cc-by-nc-sa-image]: https://licensebuttons.net/l/by-nc-sa/4.0/88x31.png
[cc-by-nc-sa-shield]: https://img.shields.io/badge/License-CC%20BY--NC--SA%204.0-lightgrey.svg
### Citation Information
If you find this repository useful, please consider giving 👏 and citing:
```
@misc{odia_master_data_llama2,
author = {Shantipriya Parida and Sambit Sekhar and Aisha Asif and Subham Pradhan and Guneet Singh Kohli and Swateek Jena},
title = {Large Odia Instruction Set for LlaMA2 Finetuning},
year = {2023},
publisher = {Hugging Face},
journal = {Hugging Face repository},
howpublished = {\url{https://huggingface.co/OdiaGenAI}},
}
```
### Contributions
- Shantipriya Parida (Silo AI, Helsinki, Finland)
- Sambit Sekhar (Odia Generative AI, Bhubaneswar, India)
- Aisha Asif (KIIT, University, Bhubaneswar, India)
- Subham Pradhan (Silicon Institute of Technology, Bhubaneswar, India)
- Guneet Singh Kohli (Thapar Institute of Engineering and Technology, India)
- Swateek Jena (RightSense Inc, USA)
提供机构:
OdiaGenAI
原始信息汇总
数据集卡片 for odia_master_data_llama2
数据集描述
数据集概述
该数据集是混合了从开源指令集翻译的Odia指令集和Odia领域知识指令集。
使用的Odia指令集包括:
- odia_domain_context_train_v1
- dolly-odia-15k
- OdiEnCorp_translation_instructions_25k
- gpt-teacher-roleplay-odia-3k
- Odia_Alpaca_instructions_52k
- hardcode_odia_qa_105
在此数据集中,Odia指令、输入和输出字符串可用。
支持的任务和排行榜
大型语言模型(LLM)
语言
Odia
数据集结构
JSON
数据字段
- output (字符串)
- instruction (字符串)
- input (字符串)
许可信息
该作品根据Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License进行许可。
引用信息
如果您发现此仓库有用,请考虑给予👏并引用:
@misc{odia_master_data_llama2, author = {Shantipriya Parida and Sambit Sekhar and Aisha Asif and Subham Pradhan and Guneet Singh Kohli and Swateek Jena}, title = {Large Odia Instruction Set for LlaMA2 Finetuning}, year = {2023}, publisher = {Hugging Face}, journal = {Hugging Face repository}, howpublished = {url{https://huggingface.co/OdiaGenAI}}, }
贡献者
- Shantipriya Parida (Silo AI, Helsinki, Finland)
- Sambit Sekhar (Odia Generative AI, Bhubaneswar, India)
- Aisha Asif (KIIT, University, Bhubaneswar, India)
- Subham Pradhan (Silicon Institute of Technology, Bhubaneswar, India)
- Guneet Singh Kohli (Thapar Institute of Engineering and Technology, India)
- Swateek Jena (RightSense Inc, USA)




