xmanii/mauxitalk-persian
收藏Hugging Face2024-11-29 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/xmanii/mauxitalk-persian
下载链接
链接失效反馈官方服务:
资源简介:
MauxiTalk是一个高质量的波斯语对话数据集,包含2000多个对话,专门用于训练和微调大型语言模型(LLMs)。数据集中的对话是从SmolTalk数据集精心翻译而来,使用了先进的语言模型。每个对话都遵循用户/助手的角色格式,涵盖了日常生活中的各种话题。数据集的结构为JSONL格式,总大小为2.85 MB,下载大小为1.17 MB。该数据集适用于波斯语语言模型的训练、现有LLMs的微调、开发对话式AI系统、波斯语NLP研究以及创建波斯语聊天机器人。
MauxiTalk is a high-quality dataset of over 2,000 Persian conversations, specifically curated for training and fine-tuning Large Language Models (LLMs). The conversations are carefully translated from the SmolTalk dataset using state-of-the-art language models. Each conversation follows a user/assistant role format and covers a variety of topics in daily life. The dataset is structured in JSONL format, with a total size of 2.85 MB and a download size of 1.17 MB. It is suitable for training Persian language models, fine-tuning existing LLMs, developing conversational AI systems, Persian NLP research, and creating Persian chatbots.
提供机构:
xmanii



