rombodawg/Everything_Instruct_Multilingual
收藏Hugging Face2024-10-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/rombodawg/Everything_Instruct_Multilingual
下载链接
链接失效反馈官方服务:
资源简介:
Everything Instruct(多语言版)是一个大规模的Alpaca指令格式数据集,包含广泛的主题,旨在将开源AI中的大型语言模型(LLM)提升到新的水平。该数据集完全未经审查(除非经过对齐,否则任何模型都不会拒绝基于此数据集的请求)。该版本的数据集支持多种语言,包括英语、俄语、中文、韩语、乌尔都语、拉丁语、阿拉伯语、德语、西班牙语、法语、印地语、意大利语、日语、荷兰语和葡萄牙语。数据集涵盖了科学、社交媒体、通用知识、多语言、烹饪、写作、医学、历史、法律、角色扮演、新闻、编码、数学、函数调用和通用指令等多个领域。
Everything Instruct (Multilingual Edition) is a massive alpaca instruct formatted dataset consisting of a wide variety of topics meant to bring LLMs to the next level in open source AI. This dataset is fully uncensored (No model will refuse any request trained on this dataset unless otherwise aligned). This version of the dataset supports multiple languages including English, Russian, Chinese, Korean, Urdu, Latin, Arabic, German, Spanish, French, Hindi, Italian, Japanese, Dutch, and Portuguese. The dataset covers various domains such as Science, Social media, General Knowledge, Multi-lingual, Cooking, Writing, Medicine, History, Law, Role-Play, News, Coding, Math, Function calling, and General Instruct.
提供机构:
rombodawg



