marcuscedricridia/UImerge-ShareGPT-deepclean-sharegpt-deepclean-sharegpt
收藏Hugging Face2025-04-04 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/marcuscedricridia/UImerge-ShareGPT-deepclean-sharegpt-deepclean-sharegpt
下载链接
链接失效反馈官方服务:
资源简介:
这是一个经过深度清理的对话数据集,包含人类和GPT之间的对话。数据集原始大小为4312条记录,经过精确去重、长度过滤、近似重复去除等步骤后,最终包含1139条记录。数据集使用ShareGPT格式,每条记录都是单列对话。
This is a deeply cleaned conversation dataset containing dialogues between humans and GPT. The dataset originally contained 4312 records, which were reduced to 1139 after steps such as exact deduplication, length filtering, and near-duplicate removal. The dataset uses the ShareGPT format, with each record being a single-column conversation.
提供机构:
marcuscedricridia



