rovi27/somos-clean-alpaca-es
收藏Hugging Face2023-09-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/rovi27/somos-clean-alpaca-es
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: text
dtype: 'null'
- name: inputs
struct:
- name: 1-instruction
dtype: string
- name: 2-input
dtype: string
- name: 3-output
dtype: string
- name: prediction
list:
- name: label
dtype: string
- name: score
dtype: float64
- name: prediction_agent
dtype: 'null'
- name: annotation
dtype: 'null'
- name: annotation_agent
dtype: 'null'
- name: vectors
struct:
- name: input
sequence: float64
- name: instruction
sequence: float64
- name: output
sequence: float64
- name: multi_label
dtype: bool
- name: explanation
dtype: 'null'
- name: id
dtype: string
- name: metadata
struct:
- name: tr-flag-1-instruction
dtype: bool
- name: tr-flag-2-input
dtype: bool
- name: tr-flag-3-output
dtype: bool
- name: status
dtype: string
- name: event_timestamp
dtype: timestamp[us]
- name: metrics
dtype: 'null'
splits:
- name: train
num_bytes: 985217301
num_examples: 51942
download_size: 651888024
dataset_size: 985217301
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
# Dataset Card for "somos-clean-alpaca-es"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
rovi27
原始信息汇总
数据集概述
数据集信息
- 特征列表:
text: 数据类型为nullinputs: 结构化数据,包含以下字段:1-instruction: 数据类型为string2-input: 数据类型为string3-output: 数据类型为string
prediction: 列表数据,包含以下字段:label: 数据类型为stringscore: 数据类型为float64
prediction_agent: 数据类型为nullannotation: 数据类型为nullannotation_agent: 数据类型为nullvectors: 结构化数据,包含以下字段:input: 序列数据类型为float64instruction: 序列数据类型为float64output: 序列数据类型为float64
multi_label: 数据类型为boolexplanation: 数据类型为nullid: 数据类型为stringmetadata: 结构化数据,包含以下字段:tr-flag-1-instruction: 数据类型为booltr-flag-2-input: 数据类型为booltr-flag-3-output: 数据类型为bool
status: 数据类型为stringevent_timestamp: 数据类型为timestamp[us]metrics: 数据类型为null
数据分割
- 训练集:
- 文件大小: 985217301 字节
- 样本数量: 51942
数据集大小
- 下载大小: 651888024 字节
- 数据集大小: 985217301 字节
配置信息
- 配置名称:
default- 数据文件:
- 分割:
train - 路径:
data/train-*
- 分割:
- 数据文件:



