preference-agents-working/enron-personalization-sample-with-metrics
收藏Hugging Face2024-06-01 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/preference-agents-working/enron-personalization-sample-with-metrics
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: string
- name: message_id
dtype: string
- name: from
sequence: string
- name: to
sequence: string
- name: date
dtype: string
- name: subject
dtype: string
- name: content
dtype: string
- name: email_context
dtype: string
- name: token_count_content
dtype: int32
- name: token_count_context
dtype: int32
- name: content_extracted
struct:
- name: databricks-dbrx-instruct
dtype: string
- name: databricks-llama-2-70b-chat
dtype: string
- name: databricks-mixtral-8x7b-instruct
dtype: string
- name: baseline_generated_emails
struct:
- name: databricks-dbrx-instruct
struct:
- name: databricks-dbrx-instruct
dtype: string
- name: databricks-llama-2-70b-chat
dtype: string
- name: databricks-mixtral-8x7b-instruct
dtype: string
- name: databricks-llama-2-70b-chat
struct:
- name: databricks-dbrx-instruct
dtype: string
- name: databricks-llama-2-70b-chat
dtype: string
- name: databricks-mixtral-8x7b-instruct
dtype: string
- name: databricks-mixtral-8x7b-instruct
struct:
- name: databricks-dbrx-instruct
dtype: string
- name: databricks-llama-2-70b-chat
dtype: string
- name: databricks-mixtral-8x7b-instruct
dtype: string
- name: automatic_eval
struct:
- name: databricks-dbrx-instruct
struct:
- name: databricks-dbrx-instruct
dtype: string
- name: databricks-llama-2-70b-chat
dtype: string
- name: databricks-mixtral-8x7b-instruct
dtype: string
- name: databricks-llama-2-70b-chat
struct:
- name: databricks-dbrx-instruct
dtype: string
- name: databricks-llama-2-70b-chat
dtype: string
- name: databricks-mixtral-8x7b-instruct
dtype: string
- name: databricks-mixtral-8x7b-instruct
struct:
- name: databricks-dbrx-instruct
dtype: string
- name: databricks-llama-2-70b-chat
dtype: string
- name: databricks-mixtral-8x7b-instruct
dtype: string
splits:
- name: train
num_bytes: 1609987
num_examples: 129
download_size: 889785
dataset_size: 1609987
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
This dataset is primarily used for email-related analysis and processing, including basic email information (such as sender, recipient, date, subject, and content) as well as some structured data (such as extracted content, generated emails, and automatic evaluation results). The dataset is divided into a training set, suitable for training models for email-related tasks.
提供机构:
preference-agents-working
原始信息汇总
数据集概述
数据集特征
- id: 字符串类型
- message_id: 字符串类型
- from: 字符串序列类型
- to: 字符串序列类型
- date: 字符串类型
- subject: 字符串类型
- content: 字符串类型
- email_context: 字符串类型
- token_count_content: 整数类型(32位)
- token_count_context: 整数类型(32位)
- content_extracted: 结构体类型,包含以下字段:
- databricks-dbrx-instruct: 字符串类型
- databricks-llama-2-70b-chat: 字符串类型
- databricks-mixtral-8x7b-instruct: 字符串类型
- baseline_generated_emails: 结构体类型,包含以下字段:
- databricks-dbrx-instruct: 结构体类型,包含以下字段:
- databricks-dbrx-instruct: 字符串类型
- databricks-llama-2-70b-chat: 字符串类型
- databricks-mixtral-8x7b-instruct: 字符串类型
- databricks-llama-2-70b-chat: 结构体类型,包含以下字段:
- databricks-dbrx-instruct: 字符串类型
- databricks-llama-2-70b-chat: 字符串类型
- databricks-mixtral-8x7b-instruct: 字符串类型
- databricks-mixtral-8x7b-instruct: 结构体类型,包含以下字段:
- databricks-dbrx-instruct: 字符串类型
- databricks-llama-2-70b-chat: 字符串类型
- databricks-mixtral-8x7b-instruct: 字符串类型
- databricks-dbrx-instruct: 结构体类型,包含以下字段:
- automatic_eval: 结构体类型,包含以下字段:
- databricks-dbrx-instruct: 结构体类型,包含以下字段:
- databricks-dbrx-instruct: 字符串类型
- databricks-llama-2-70b-chat: 字符串类型
- databricks-mixtral-8x7b-instruct: 字符串类型
- databricks-llama-2-70b-chat: 结构体类型,包含以下字段:
- databricks-dbrx-instruct: 字符串类型
- databricks-llama-2-70b-chat: 字符串类型
- databricks-mixtral-8x7b-instruct: 字符串类型
- databricks-mixtral-8x7b-instruct: 结构体类型,包含以下字段:
- databricks-dbrx-instruct: 字符串类型
- databricks-llama-2-70b-chat: 字符串类型
- databricks-mixtral-8x7b-instruct: 字符串类型
- databricks-dbrx-instruct: 结构体类型,包含以下字段:
数据集分割
- train: 数据量1609987字节,包含129个样本
数据集大小
- 下载大小: 889785字节
- 数据集大小: 1609987字节



