OpenFact/CLEF24-CheckThat-Task1-en-all
收藏Hugging Face2024-05-03 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/OpenFact/CLEF24-CheckThat-Task1-en-all
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: sentence_id
dtype: int64
- name: text
dtype: string
- name: label_ct
dtype: int64
- name: original_split
dtype: string
- name: speaker
dtype: string
- name: speaker_title
dtype: string
- name: speaker_party
dtype: string
- name: speaker_role
dtype: string
- name: file_id
dtype: string
- name: line_number
dtype: int64
- name: sentiment
dtype: float64
- name: verdict
dtype: int64
- name: annotation
dtype: string
- name: q_20
dtype: bool
- name: q_25
dtype: bool
- name: q_30
dtype: bool
- name: quality
dtype: int64
- name: verdict_fixed
dtype: int64
- name: label_true
dtype: int64
splits:
- name: train
num_bytes: 5739315
num_examples: 22501
- name: dev
num_bytes: 252100
num_examples: 1032
- name: dev_test
num_bytes: 60191
num_examples: 318
- name: test
num_bytes: 66753
num_examples: 341
- name: train_rus
num_bytes: 2853513
num_examples: 10810
- name: dev_rus
num_bytes: 125511
num_examples: 486
- name: dev_test_rus
num_bytes: 42068
num_examples: 216
- name: train_hq
num_bytes: 3307024
num_examples: 12966
- name: train_lq
num_bytes: 2432291
num_examples: 9535
- name: train_hq_rus
num_bytes: 1347243
num_examples: 5042
- name: train_lq_rus
num_bytes: 1504255
num_examples: 5768
- name: train_hq20
num_bytes: 2158475
num_examples: 8292
- name: train_hq25
num_bytes: 2490879
num_examples: 9674
- name: train_hq30
num_bytes: 2830071
num_examples: 11056
download_size: 9026115
dataset_size: 25209689
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: dev
path: data/dev-*
- split: dev_test
path: data/dev_test-*
- split: test
path: data/test-*
- split: train_rus
path: data/train_rus-*
- split: dev_rus
path: data/dev_rus-*
- split: dev_test_rus
path: data/dev_test_rus-*
- split: train_hq
path: data/train_hq-*
- split: train_lq
path: data/train_lq-*
- split: train_hq_rus
path: data/train_hq_rus-*
- split: train_lq_rus
path: data/train_lq_rus-*
- split: train_hq20
path: data/train_hq20-*
- split: train_hq25
path: data/train_hq25-*
- split: train_hq30
path: data/train_hq30-*
---
提供机构:
OpenFact
原始信息汇总
数据集概述
数据集特征
- sentence_id: 整数类型
- text: 字符串类型
- label_ct: 整数类型
- original_split: 字符串类型
- speaker: 字符串类型
- speaker_title: 字符串类型
- speaker_party: 字符串类型
- speaker_role: 字符串类型
- file_id: 字符串类型
- line_number: 整数类型
- sentiment: 浮点数类型
- verdict: 整数类型
- annotation: 字符串类型
- q_20: 布尔类型
- q_25: 布尔类型
- q_30: 布尔类型
- quality: 整数类型
- verdict_fixed: 整数类型
- label_true: 整数类型
数据集分割
- train: 22501个样本,5739315字节
- dev: 1032个样本,252100字节
- dev_test: 318个样本,60191字节
- test: 341个样本,66753字节
- train_rus: 10810个样本,2853513字节
- dev_rus: 486个样本,125511字节
- dev_test_rus: 216个样本,42068字节
- train_hq: 12966个样本,3307024字节
- train_lq: 9535个样本,2432291字节
- train_hq_rus: 5042个样本,1347243字节
- train_lq_rus: 5768个样本,1504255字节
- train_hq20: 8292个样本,2158475字节
- train_hq25: 9674个样本,2490879字节
- train_hq30: 11056个样本,2830071字节
数据集大小
- 下载大小: 9026115字节
- 数据集大小: 25209689字节
配置文件
- config_name: default
- data_files: 包含多个分割的数据文件路径,如
data/train-*等。



