Weni/wenigpt-agent-sft-2.0.0
收藏Hugging Face2024-09-19 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Weni/wenigpt-agent-sft-2.0.0
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- pt
dataset_info:
features:
- name: id
dtype: int64
- name: external_id
dtype: int64
- name: name
dtype: string
- name: occupation
dtype: string
- name: adjective
dtype: string
- name: chatbot_goal
dtype: string
- name: instructions
sequence: string
- name: content
dtype: string
- name: chunks_small
list:
- name: content
dtype: string
- name: score
dtype: float64
- name: chunks_big
list:
- name: content
dtype: string
- name: score
dtype: float64
- name: data_category
dtype: int64
- name: question
dtype: string
- name: answer
dtype: string
splits:
- name: train
num_bytes: 7059194
num_examples: 429
- name: test
num_bytes: 2203690
num_examples: 194
download_size: 2856523
dataset_size: 9262884
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
---
language:
- 葡萄牙语(Portuguese)
数据集信息(dataset_info):
特征字段(features):
- 字段:id,数据类型:64位整型(int64)
- 字段:external_id,数据类型:64位整型(int64)
- 字段:name,数据类型:字符串(string)
- 字段:occupation,数据类型:字符串(string)
- 字段:adjective,数据类型:字符串(string)
- 字段:chatbot_goal,数据类型:字符串(string)
- 字段:instructions,类型:字符串序列(sequence<string>)
- 字段:content,数据类型:字符串(string)
- 字段:chunks_small,为列表类型,包含子字段:
- 子字段:content,数据类型:字符串(string)
- 子字段:score,数据类型:64位浮点型(float64)
- 字段:chunks_big,为列表类型,包含子字段:
- 子字段:content,数据类型:字符串(string)
- 子字段:score,数据类型:64位浮点型(float64)
- 字段:data_category,数据类型:64位整型(int64)
- 字段:question,数据类型:字符串(string)
- 字段:answer,数据类型:字符串(string)
划分集(splits):
- 划分名称:训练集(train),字节数:7059194,样本数量:429
- 划分名称:测试集(test),字节数:2203690,样本数量:194
下载大小(download_size): 2856523
数据集总大小(dataset_size): 9262884
配置项(configs):
- 配置名称:默认(default),数据文件:
- 划分:训练集(train),路径:data/train-*
- 划分:测试集(test),路径:data/test-*
提供机构:
Weni



