justinian336/salvadoran-news-edh
收藏Hugging Face2024-07-22 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/justinian336/salvadoran-news-edh
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为salvadoran-news-edh,包含萨尔瓦多新闻相关的数据。数据集的特征包括图像来源(image_src)、标题(title)、内容(content)、类别(category)和链接(link)。类别字段是一个分类标签,包含多个类别,如意见/漫画、意见、视频、体育/世界杯区、图片库、生活、新闻、娱乐和体育等。数据集分为训练集,包含55345个样本,总大小为196407515字节。下载大小为111585637字节。
The dataset named salvadoran-news-edh contains data related to Salvadoran news. The features of the dataset include image source (image_src), title (title), content (content), category (category), and link (link). The category field is a class label with multiple categories such as opinion/caricaturas, opinion, videos, sports/world-cup-zone, photo galleries, life, news, entertainment, and sports. The dataset is divided into a training set containing 55,345 samples with a total size of 196,407,515 bytes. The download size is 111,585,637 bytes.
提供机构:
justinian336
原始信息汇总
数据集概述
数据集名称
- 名称: salvadoran-news-edh
数据集特征
- 特征列表:
- image_src: 数据类型为字符串
- title: 数据类型为字符串
- content: 数据类型为字符串
- category: 数据类型为分类标签,包含以下类别:
- 0: vida
- 1: deportes/zona-mundialista
- 2: entretenimiento
- 3: videos
- 4: deportes
- 5: noticias
- 6: null
- 7: opinion
- 8: fotogalerias
- 9: opinion/caricaturas
- link: 数据类型为字符串
数据集划分
- 训练集:
- 大小: 196407515 字节
- 样本数量: 55345
数据集大小
- 下载大小: 111585561 字节
- 数据集总大小: 196407515 字节



