zhan1993/flan-10k-flat-10cluster-embedding
收藏Hugging Face2024-06-12 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/zhan1993/flan-10k-flat-10cluster-embedding
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: source
dtype: string
- name: target
dtype: string
- name: task_name
dtype: string
- name: task_source
dtype: string
- name: template_type
dtype: string
- name: template_idx
dtype: int64
- name: split
dtype: string
- name: cluster_id
dtype: string
splits:
- name: train
num_bytes: 5576922634
num_examples: 2391621
download_size: 3023562796
dataset_size: 5576922634
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
# Dataset Card for "flan-10k-flat-10cluster-embedding"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
This dataset includes multiple features such as source text, target text, task name, task source, template type, template index, dataset split, and cluster ID. It is primarily used for training, containing 2391621 samples with a total size of 5576922634 bytes and a download size of 3023562796 bytes.
提供机构:
zhan1993
原始信息汇总
数据集概述
数据集信息
-
特征:
source: 字符串类型target: 字符串类型task_name: 字符串类型task_source: 字符串类型template_type: 字符串类型template_idx: 整数类型split: 字符串类型cluster_id: 字符串类型
-
分割:
train: 包含2,391,621个样本,占用5,576,922,634字节
-
下载大小:3,023,562,796字节
-
数据集大小:5,576,922,634字节
配置
- 配置名称:
default- 数据文件:
train: 路径为data/train-*
- 数据文件:



