d2uxd2ux/kyungsang_ko_class_new
收藏Hugging Face2026-04-24 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/d2uxd2ux/kyungsang_ko_class_new
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个韩语问答数据集,由标准问题与庆尚道方言回答组成。输入格式为问题,输出格式为回答,特点是使用标准韩语提问,庆尚道方言回答,方言强度为强。数据集结构为每个样本包含一个问题和一个方言回答。数据集分为训练集(212个样本)和验证集(24个样本)。适用于韩语LLM指令调优、标准语到庆尚道方言的风格转换、方言生成及风格控制实验等。需要注意的是,数据集是通过生成方式构建的,可能与实际地区使用者的自然表达存在差异,且方言表达可能因地区、年龄和语境而异。
This dataset is a Korean question-answering dataset composed of standard language questions and Gyeongsang dialect answers. The input format is a question, and the output format is an answer, characterized by standard Korean questions and strong Gyeongsang dialect answers. The dataset structure consists of each sample containing a question and a dialect answer. The dataset is divided into a training set (212 samples) and a validation set (24 samples). It is suitable for Korean LLM instruction tuning, standard language to Gyeongsang dialect style conversion, dialect generation, and style control experiments. Note that the dataset is constructed in a generative manner, which may differ from the natural expressions of actual regional speakers, and dialect expressions may vary by region, age, and context.
提供机构:
d2uxd2ux



