jaeyong2/persona-inst
收藏Hugging Face2024-10-26 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/jaeyong2/persona-inst
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多语言文本数据,特征包括Level、English、Korean、Japanese、Thai、Vietnamese和context。数据集主要用于多语言文本处理任务,如机器翻译或跨语言理解。数据集的开发过程涉及从PersonaHub生成persona对,并使用Qwen2-72B-Instruct模型生成问题。数据集的使用示例展示了如何通过HuggingFace的datasets库加载数据集。数据集的许可证为CC-BY-NC-SA-4.0,研究得到了TPU Research Cloud program的支持。
This dataset contains multilingual text data with features including Level, English, Korean, Japanese, Thai, Vietnamese, and context. It is primarily used for multilingual text processing tasks such as machine translation or cross-lingual understanding. The development process of the dataset involves generating persona pairs from PersonaHub and using the Qwen2-72B-Instruct model to generate questions. The usage example demonstrates how to load the dataset using the HuggingFace datasets library. The dataset is licensed under CC-BY-NC-SA-4.0, and the research is supported by the TPU Research Cloud program.
提供机构:
jaeyong2



