cognitivecomputations/WizardLM_alpaca_evol_instruct_70k_unfiltered
收藏Hugging Face2023-04-28 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/cognitivecomputations/WizardLM_alpaca_evol_instruct_70k_unfiltered
下载链接
链接失效反馈官方服务:
资源简介:
This dataset is the WizardLM dataset victor123/evol_instruct_70k, removing instances of blatant alignment.
54974 instructions remain.
inspired by https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered
All credit to anon8231489123 for the cleanup script that I adapted to wizardlm_clean.py
---
license: apache-2.0
language:
- en
pretty_name: wizardlm-unfiltered
---
提供机构:
cognitivecomputations
原始信息汇总
数据集概述
数据集名称
- 名称: WizardLM 数据集
- 版本: victor123/evol_instruct_70k
数据集内容
- 原始数据量: 70k
- 处理后数据量: 54974条指令
- 处理说明: 移除了明显的对齐实例
数据集灵感来源
- 灵感来源: https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered
数据集清理脚本
- 清理脚本: wizardlm_clean.py
- 脚本来源: 改编自anon8231489123的清理脚本
数据集属性
- 许可证: Apache-2.0
- 语言: 英语
- 别名: wizardlm-unfiltered



