minyichen/tw-instruct-500k-cleaned
收藏Hugging Face2025-01-24 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/minyichen/tw-instruct-500k-cleaned
下载链接
链接失效反馈官方服务:
资源简介:
这是一个在台湾常用的任务导向对话数据集,包含499148条对话数据。数据集对原始的lianghsun/tw-instruct数据集进行了修正,包括简繁转换的错误和删除了模型回答无穷回覆的数据。适用于文本生成任务,支持中文和英文两种语言。
This is a dataset of commonly used task-oriented dialogues in Taiwan, containing 499148 conversation entries. The dataset is a corrected version of the original lianghsun/tw-instruct dataset, including the correction of Simplified to Traditional Chinese conversion errors and the removal of data with infinite loop responses from the model. It is suitable for text generation tasks and supports both Chinese and English languages.
提供机构:
minyichen



