WiNE-iNEFF/1M-OpenOrca_be
收藏Hugging Face2024-12-20 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/WiNE-iNEFF/1M-OpenOrca_be
下载链接
链接失效反馈官方服务:
资源简介:
白俄罗斯语OpenOrca数据集是一个丰富的增强FLAN数据对齐集合,已翻译成白俄罗斯语。该数据集旨在帮助训练白俄罗斯语的大型语言模型,并支持其他NLP任务。数据集有两个版本:约1M的GPT-4完成版本(正在翻译中)和约3.2M的GPT-3.5完成版本(未来可能翻译)。数据字段包括id、system_prompt、question和response。数据来源于OpenOrca_ru和OpenOrca。
The Belarusian OpenOrca dataset is a rich collection of augmented FLAN data that has been translated into Belarusian. This dataset aims to assist in training LLMs in the Belarusian language and support other NLP tasks. The dataset has two versions: 1. Approximately 1 million GPT-4 completions (currently being translated); 2. Approximately 3.2 million GPT-3.5 completions (may be translated in the future). The dataset fields include: id (a unique numbered identifier), system_prompt (the system prompt), question (the question), and response (the response).
提供机构:
WiNE-iNEFF



