Mattimax/Fusion_Ita_Datasets_2
收藏Hugging Face2025-09-26 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/Mattimax/Fusion_Ita_Datasets_2
下载链接
链接失效反馈官方服务:
资源简介:
Mattimax/Fusion_Ita_Datasets_v2 是一个由多种公共对话、指令和问答数据集融合和标准化而成的意大利语数据集。该数据集经过过滤,移除了空值和重复值,适用于训练用于文本完成、问答和多轮对话的语言模型。数据集来源于 efederici/shp-partial-it、mchl-labs/stambecco_data_it、ReDiX/everyday-conversations-ita 和 ReDiX/QA-ita-200k 四个数据集。
Mattimax/Fusion_Ita_Datasets_v2 is an Italian language dataset created by fusing and normalizing various public datasets including conversations, instructions, and QA. The dataset has been filtered to remove null and duplicate values and is suitable for training language models for text completion, question-answering, and multi-turn dialogues. The data sources include efederici/shp-partial-it, mchl-labs/stambecco_data_it, ReDiX/everyday-conversations-ita, and ReDiX/QA-ita-200k.
提供机构:
Mattimax



