lavi13/wiki_qa_instructions_ro
收藏Hugging Face2024-05-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/lavi13/wiki_qa_instructions_ro
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- question-answering
language:
- ro
size_categories:
- 10K<n<100K
---
# Dataset Card for Dataset Name
The dataset is created starting from a randomly selected ~10k set of entries from the Wikipedia dataset (https://huggingface.co/datasets/wikimedia/wikipedia),
and using Mixtral (https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) to extract Q&A pairs from each paragraph.
Minimal post-processing and format processing is applied to the Mixtral outputs.
## Dataset Details
### Dataset Description
- **Curated by:** lavi13
- **Language(s) (NLP):** Romanian
### Uses
It is intended to be used as instruction tuning QA data in Romanian.
提供机构:
lavi13
原始信息汇总
数据集卡片
数据集详情
数据集描述
- 任务类别: 问答
- 语言: 罗马尼亚语
- 数据规模: 10K<n<100K
- 创建者: lavi13
用途
该数据集旨在用作罗马尼亚语指令调优的问答数据。



