avsolatorio/doc-topics-synthetic_data
收藏Hugging Face2024-04-22 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/avsolatorio/doc-topics-synthetic_data
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: processed
features:
- name: sentence
dtype: string
- name: topics
sequence: string
- name: model_name
dtype: string
splits:
- name: train
num_bytes: 7112254
num_examples: 41416
download_size: 2423429
dataset_size: 7112254
- config_name: raw
features:
- name: content
dtype: string
- name: response_metadata
struct:
- name: token_usage
struct:
- name: completion_time
dtype: float64
- name: completion_tokens
dtype: int64
- name: prompt_time
dtype: float64
- name: prompt_tokens
dtype: int64
- name: queue_time
dtype: 'null'
- name: total_time
dtype: float64
- name: total_tokens
dtype: int64
- name: model_name
dtype: string
- name: system_fingerprint
dtype: string
- name: finish_reason
dtype: string
- name: logprobs
dtype: 'null'
- name: type
dtype: string
- name: name
dtype: 'null'
- name: id
dtype: string
- name: example
dtype: bool
- name: tool_calls
sequence: 'null'
- name: invalid_tool_calls
sequence: 'null'
- name: model_name
dtype: string
splits:
- name: train
num_bytes: 8217048
num_examples: 2015
download_size: 2972054
dataset_size: 8217048
configs:
- config_name: processed
data_files:
- split: train
path: processed/train-*
- config_name: raw
data_files:
- split: train
path: raw/train-*
---
提供机构:
avsolatorio
原始信息汇总
数据集概述
配置信息
配置名称:processed
- 特征:
sentence:字符串类型topics:字符串序列model_name:字符串类型
- 分割:
train:- 字节数:7112254
- 样本数:41416
- 下载大小:2423429 字节
- 数据集大小:7112254 字节
配置名称:raw
- 特征:
content:字符串类型response_metadata:结构体类型,包含以下字段:token_usage:结构体类型,包含以下字段:completion_time:浮点数类型completion_tokens:整数类型prompt_time:浮点数类型prompt_tokens:整数类型queue_time:空类型total_time:浮点数类型total_tokens:整数类型
model_name:字符串类型system_fingerprint:字符串类型finish_reason:字符串类型logprobs:空类型
type:字符串类型name:空类型id:字符串类型example:布尔类型tool_calls:空序列invalid_tool_calls:空序列model_name:字符串类型
- 分割:
train:- 字节数:8217048
- 样本数:2015
- 下载大小:2972054 字节
- 数据集大小:8217048 字节
数据文件
-
配置名称:processed
train:路径为processed/train-*
-
配置名称:raw
train:路径为raw/train-*



