Aratako/Synthetic-JP-10-Turns-Roleplay-Dialogues-Nemotron-4-1k
收藏Hugging Face2024-07-03 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/Aratako/Synthetic-JP-10-Turns-Roleplay-Dialogues-Nemotron-4-1k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为Synthetic-JP-10-Turns-Roleplay-Dialogues-Nemotron-4-1k,包含约1000条日语角色扮演对话,每条对话包含10个回合。数据集是使用NVIDIA的Nemotron-4-340B-Instruct模型生成的,并基于Magpie方法创建的合成instruction数据集。数据生成过程中使用了DeepInfra平台,且未进行事后过滤处理,因此可能包含质量较低的记录。此外,部分数据在长对话中可能倾向于提前结束角色扮演。
This dataset is generated using the [nvidia/Nemotron-4-340B-Instruct](https://huggingface.co/nvidia/Nemotron-4-340B-Instruct) model and contains approximately 1000 Japanese roleplay dialogues, each with 10 turns. It is based on the [Aratako/Synthetic-JP-Roleplay-Instruction-Nemotron-4-1k](https://huggingface.co/datasets/Aratako/Synthetic-JP-Roleplay-Instruction-Nemotron-4-1k) synthetic instruction dataset and uses the Magpie method to generate continuous dialogues. The dataset has not undergone post-filtering processing, so it may contain records of low quality. Additionally, dialogues in the dataset tend to end prematurely during long turns, suggesting that limiting the number of turns used might be advisable.
提供机构:
Aratako



