NorHsangPha/oasst1_shan_translation
收藏Hugging Face2024-07-11 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/NorHsangPha/oasst1_shan_translation
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是OpenAssistant/oasst1数据集的Shan语言翻译版本,使用了facebook/nllb-200-3.3B模型进行翻译。数据集包含多个字段,如message_id、parent_id、user_id、created_date、text、role、lang、review_count、review_result、deleted、rank、synthetic、model_name、detoxify、message_tree_id、tree_state、emojis和labels。数据集分为训练集和验证集,训练集包含80917个样本,验证集包含3291个样本。数据质量尚未经过人工检查,可能存在低质量问题。
This dataset is a translation version of OpenAssistant/oasst1 to Shan language, translated by facebook/nllb-200-3.3B. It includes features such as message ID, user ID, creation date, text content, role, language, review count, review result, deletion status, rank, synthetic flag, model name, detoxification analysis, message tree ID, tree state, emojis, and labels. The dataset is split into a training set with 80917 samples and a validation set with 3291 samples. Note that the data quality has not been checked by a human yet, so it might be of low quality.
提供机构:
NorHsangPha



