jtatman/tinymistral-hypnosis-instruct-preprocessed
收藏Hugging Face2024-01-16 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/jtatman/tinymistral-hypnosis-instruct-preprocessed
下载链接
链接失效反馈官方服务:
资源简介:
该数据集用于加速处理,基于Locutusque/TinyMistral-248M-Instruct模型生成嵌入。数据集包含文本、输入ID和注意力掩码等特征,分为训练集和评估集,分别包含大量和少量示例。数据集适用于问答、文本生成和对话等任务,涉及健康、医疗、治疗和催眠等主题,语言为英语。
This dataset is designed for accelerated processing, with its embeddings generated using the Locutusque/TinyMistral-248M-Instruct model. It includes features such as text, input IDs, and attention masks, and is split into training and evaluation subsets, which contain a large number and a small number of samples respectively. This dataset is applicable to tasks including question answering, text generation, and conversational scenarios, covering topics such as health, medical care, treatment, and hypnosis, and all content is in English.
提供机构:
jtatman
原始信息汇总
数据集概述
数据集信息
特征
- text: 类型为字符串。
- input_ids: 类型为整数序列,数据类型为int32。
- attention_mask: 类型为整数序列,数据类型为int8。
分割
- train:
- 字节数: 1031463407.6346276
- 样本数: 2832454
- eval:
- 字节数: 9103.973159269555
- 样本数: 25
大小
- 下载大小: 307894933
- 数据集大小: 1031472511.6077869
配置
- default:
- 训练数据文件路径: data/train-*
- 评估数据文件路径: data/eval-*
许可
- 许可证: apache-2.0
任务类别
- 问答
- 文本生成
- 对话
语言
- 英语
标签
- 健康
- 医疗
- 治疗
- 催眠
显示名称
- hypnosis instruct
大小类别
- 1M<n<10M



