celsowm/adoro_cinema_filmes
收藏Hugging Face2024-05-30 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/celsowm/adoro_cinema_filmes
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: titulo
dtype: string
- name: sinopse
dtype: string
- name: generos
sequence: string
- name: link
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 23369140
num_examples: 42918
download_size: 13807632
dataset_size: 23369140
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
license: apache-2.0
task_categories:
- text-classification
task_ids:
- multi-label-classification
language:
- pt
---
# Dataset Card for "adoro_cinema_filmes"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
dataset_info:
features:
- name: 标题(titulo)
dtype: 字符串(string)
- name: 剧情梗概(sinopse)
dtype: 字符串(string)
- name: 影视类型(generos)
dtype: 字符串序列(sequence: string)
- name: 链接(link)
dtype: 字符串(string)
- name: __index_level_0__
dtype: 64位整数(int64)
splits:
- name: 训练集(train)
num_bytes: 23369140
num_examples: 42918
download_size: 13807632
dataset_size: 23369140
configs:
- config_name: 默认配置(default)
data_files:
- split: 训练集(train)
path: data/train-*
license: Apache 2.0 协议
task_categories:
- 文本分类(text-classification)
task_ids:
- 多标签分类(multi-label-classification)
language:
- 葡萄牙语(pt)
---
# 「adoro_cinema_filmes」数据集卡片
[需补充更多信息](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
celsowm
原始信息汇总
数据集概述
数据集信息
特征
- titulo: 类型为字符串
- sinopse: 类型为字符串
- generos: 类型为字符串序列
- link: 类型为字符串
- index_level_0: 类型为int64
分割
- train: 包含42918个样本,占用23369140字节
大小
- 下载大小: 13807632字节
- 数据集大小: 23369140字节
配置
- default: 包含训练数据文件,路径为
data/train-*
许可
- apache-2.0
任务类别
- text-classification
任务ID
- multi-label-classification
语言
- pt (葡萄牙语)



