Multimodal-Fatima/Hatefulmemes_train
收藏Hugging Face2023-05-07 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Multimodal-Fatima/Hatefulmemes_train
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: image
dtype: image
- name: text
dtype: string
- name: label
dtype:
class_label:
names:
'0': not-hateful
'1': hateful
- name: id
dtype: int64
- name: clip_tags_ViT_L_14
sequence: string
- name: blip_caption
dtype: string
- name: LLM_Description_gpt3_downstream_tasks_ViT_L_14
sequence: string
- name: clip_tags_LAION_ViT_H_14_2B
sequence: string
- name: blip_caption_beam_5
dtype: string
- name: LLM_Description_gpt3_downstream_tasks_visual_genome_ViT_L_14
sequence: string
- name: LLM_Description_gpt3_downstream_tasks_visual_genome_LAION-ViT-H-14-2B
sequence: string
- name: DETA_detections_deta_swin_large_o365_coco_classes
list:
- name: attribute
dtype: string
- name: box
sequence: float32
- name: label
dtype: string
- name: location
dtype: string
- name: ratio
dtype: float32
- name: size
dtype: string
- name: tag
dtype: string
- name: Attributes_ViT_L_14_descriptors_text_davinci_003_full
sequence: string
- name: Attributes_LAION_ViT_H_14_2B_descriptors_text_davinci_003_full
sequence: string
splits:
- name: train
num_bytes: 3066249406.0
num_examples: 8500
download_size: 3059695187
dataset_size: 3066249406.0
---
# Dataset Card for "Hatefulmemes_train"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
Multimodal-Fatima
原始信息汇总
数据集概述
数据集名称
- Hatefulmemes_train
数据集特征
- image:图像数据
- text:文本数据,类型为字符串
- label:分类标签,类型为类别标签,包含两个类别:0: not-hateful, 1: hateful
- id:标识符,类型为int64
- clip_tags_ViT_L_14:序列,类型为字符串
- blip_caption:文本数据,类型为字符串
- LLM_Description_gpt3_downstream_tasks_ViT_L_14:序列,类型为字符串
- clip_tags_LAION_ViT_H_14_2B:序列,类型为字符串
- blip_caption_beam_5:文本数据,类型为字符串
- LLM_Description_gpt3_downstream_tasks_visual_genome_ViT_L_14:序列,类型为字符串
- LLM_Description_gpt3_downstream_tasks_visual_genome_LAION-ViT-H-14-2B:序列,类型为字符串
- DETA_detections_deta_swin_large_o365_coco_classes:列表,包含以下子特征:
- attribute:文本数据,类型为字符串
- box:序列,类型为float32
- label:文本数据,类型为字符串
- location:文本数据,类型为字符串
- ratio:数值,类型为float32
- size:文本数据,类型为字符串
- tag:文本数据,类型为字符串
- Attributes_ViT_L_14_descriptors_text_davinci_003_full:序列,类型为字符串
- Attributes_LAION_ViT_H_14_2B_descriptors_text_davinci_003_full:序列,类型为字符串
数据集分割
- train:训练集,包含8500个样本,数据大小为3066249406.0字节
数据集大小
- 下载大小:3059695187字节
- 数据集大小:3066249406.0字节



