vinhtran2611/lmsys_chatbot_arena_conversations_filtered

Name: vinhtran2611/lmsys_chatbot_arena_conversations_filtered
Creator: vinhtran2611
Published: 2024-06-14 07:04:17
License: 暂无描述

Hugging Face2024-06-14 更新2024-06-29 收录

下载链接：

https://hf-mirror.com/datasets/vinhtran2611/lmsys_chatbot_arena_conversations_filtered

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: question_id dtype: string - name: model_a dtype: string - name: model_b dtype: string - name: winner dtype: string - name: judge dtype: string - name: conversation_a list: - name: content dtype: string - name: role dtype: string - name: conversation_b list: - name: content dtype: string - name: role dtype: string - name: turn dtype: int64 - name: anony dtype: bool - name: language dtype: string - name: tstamp dtype: float64 - name: openai_moderation struct: - name: categories struct: - name: harassment dtype: bool - name: harassment/threatening dtype: bool - name: hate dtype: bool - name: hate/threatening dtype: bool - name: self-harm dtype: bool - name: self-harm/instructions dtype: bool - name: self-harm/intent dtype: bool - name: sexual dtype: bool - name: sexual/minors dtype: bool - name: violence dtype: bool - name: violence/graphic dtype: bool - name: category_scores struct: - name: harassment dtype: float64 - name: harassment/threatening dtype: float64 - name: hate dtype: float64 - name: hate/threatening dtype: float64 - name: self-harm dtype: float64 - name: self-harm/instructions dtype: float64 - name: self-harm/intent dtype: float64 - name: sexual dtype: float64 - name: sexual/minors dtype: float64 - name: violence dtype: float64 - name: violence/graphic dtype: float64 - name: flagged dtype: bool - name: toxic_chat_tag struct: - name: roberta-large struct: - name: flagged dtype: bool - name: probability dtype: float64 - name: t5-large struct: - name: flagged dtype: bool - name: score dtype: float64 splits: - name: train num_bytes: 130347.62021212121 num_examples: 53 download_size: 116876 dataset_size: 130347.62021212121 configs: - config_name: default data_files: - split: train path: data/train-* --- # Dataset Card for "lmsys_chatbot_arena_conversations_filtered" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

提供机构：

vinhtran2611

原始信息汇总

数据集概述

数据集名称

lmsys_chatbot_arena_conversations_filtered

数据集特征

question_id: 字符串类型
model_a: 字符串类型
model_b: 字符串类型
winner: 字符串类型
judge: 字符串类型
conversation_a: 列表类型
- content: 字符串类型
- role: 字符串类型
conversation_b: 列表类型
- content: 字符串类型
- role: 字符串类型
turn: 整数类型
anony: 布尔类型
language: 字符串类型
tstamp: 浮点数类型
openai_moderation: 结构体类型
- categories: 结构体类型
  - harassment: 布尔类型
  - harassment/threatening: 布尔类型
  - hate: 布尔类型
  - hate/threatening: 布尔类型
  - self-harm: 布尔类型
  - self-harm/instructions: 布尔类型
  - self-harm/intent: 布尔类型
  - sexual: 布尔类型
  - sexual/minors: 布尔类型
  - violence: 布尔类型
  - violence/graphic: 布尔类型
- category_scores: 结构体类型
  - harassment: 浮点数类型
  - harassment/threatening: 浮点数类型
  - hate: 浮点数类型
  - hate/threatening: 浮点数类型
  - self-harm: 浮点数类型
  - self-harm/instructions: 浮点数类型
  - self-harm/intent: 浮点数类型
  - sexual: 浮点数类型
  - sexual/minors: 浮点数类型
  - violence: 浮点数类型
  - violence/graphic: 浮点数类型
- flagged: 布尔类型
toxic_chat_tag: 结构体类型
- roberta-large: 结构体类型
  - flagged: 布尔类型
  - probability: 浮点数类型
- t5-large: 结构体类型
  - flagged: 布尔类型
  - score: 浮点数类型

数据集分割

train:
- num_bytes: 130347.62021212121
- num_examples: 53

数据集大小

download_size: 116876
dataset_size: 130347.62021212121

配置

config_name: default
- data_files:
  - split: train
  - path: data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集