Locutusque/InstructMixCleaned
收藏Hugging Face2023-11-10 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Locutusque/InstructMixCleaned
下载链接
链接失效反馈官方服务:
资源简介:
---
name: InstructiveMixCleaned
tagline: A Combined Dataset of Diverse Instructional Content
description: >
InstructiveMix is a comprehensive dataset that brings together various
instructional content from different domains. It combines instructions for
tasks, code, poems, math, essays, medical texts, and more. With a diverse
range of instructional data, this dataset is suitable for a wide range of
natural language processing (NLP) tasks and research.
authors:
- name: Locutusque
email: locutusque.airshipcraft@gmail.com
task_categories:
- text-generation
- conversational
- question-answering
language:
- en
pretty_name: Instruct Mix Cleaned
license: apache-2.0
---
Cleaned the dataset https://huggingface.co/datasets/Locutusque/InstructMix to remove RLHF responses
提供机构:
Locutusque
原始信息汇总
InstructiveMixCleaned
简介
InstructiveMixCleaned 是一个综合性的数据集,汇集了来自不同领域的多样化教学内容。该数据集结合了任务说明、代码、诗歌、数学、论文、医学文本等多种类型的教学数据,适用于广泛的NLP任务和研究。
作者
- 名称:Locutusque
- 邮箱:locutusque.airshipcraft@gmail.com
任务类别
- 文本生成
- 对话系统
- 问答系统
语言
- 英语
数据集名称
- Instruct Mix Cleaned
许可证
- Apache 2.0
数据集处理
- 清理了原始数据集 https://huggingface.co/datasets/Locutusque/InstructMix,移除了RLHF响应。



