bryandts/instruction-dataset-indo-java-sunda-bali-gayo-batak-alas-minang-betawi
收藏Hugging Face2024-12-17 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/bryandts/instruction-dataset-indo-java-sunda-bali-gayo-batak-alas-minang-betawi
下载链接
链接失效反馈官方服务:
资源简介:
该数据集支持印度尼西亚语、巽他语和爪哇语,包含输出、输入和指令三个特征。数据集划分为训练集,包含282047个样本,总大小为157776049字节。数据集的任务类别包括文本生成和问答,规模类别为100K<n<1M。
The dataset includes three languages: Indonesian (id), Sundanese (su), and Javanese (jv), primarily for text generation and question-answering tasks. The dataset features include output, input, and instruction, all of which are string types. The dataset is divided into a training set containing 282047 samples, with a size between 100K and 1M.
提供机构:
bryandts



