five

Aratako/magpie-ultra-v0.1-formatted

收藏
Hugging Face2024-11-25 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Aratako/magpie-ultra-v0.1-formatted
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个将argilla/magpie-ultra-v0.1数据集转换为OpenAI messages格式的数据集。数据集包含多个特征字段,如模型名称响应基础、指令、响应、响应基础、意图、知识、难度、模型名称难度、解释、质量、模型名称质量、主要标签、其他标签、模型名称分类、嵌入、模型名称嵌入、分数、分数基础、distilabel元数据、最近邻索引、最近邻分数、消息、守卫、模型名称守卫、安全、危险类别、分数差异和对话等。数据集分为训练集,包含50,000个样本,总大小为956,686,523字节。由于原始数据中已存在相同格式的messages列,因此使用此数据集的必要性不大。

This is a dataset that converts the argilla/magpie-ultra-v0.1 dataset into the OpenAI messages format. The dataset includes multiple feature fields such as model_name_response_base, instruction, response, response_base, intent, knowledge, difficulty, model_name_difficulty, explanation, quality, model_name_quality, primary_tag, other_tags, model_name_classification, embedding, model_name_embeddings, score, score_base, distilabel_metadata, nn_indices, nn_scores, messages, guard, model_name_guard, safe, hazard_category, score_difference, and conversations. The dataset is divided into a training set containing 50,000 samples with a total size of 956,686,523 bytes. Since the original data already contains a messages column in the same format, there is little necessity to use this dataset.
提供机构:
Aratako
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作