Flaglab/latam-xix
收藏Hugging Face2024-10-21 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Flaglab/latam-xix
下载链接
链接失效反馈官方服务:
资源简介:
---
annotations_creators:
- no-annotation
language:
- es
language_creators:
- crowdsourced
- machine-generated
license:
- mit
multilinguality:
- monolingual
pretty_name: Latin-American XIX Century Spanish Corpus
size_categories:
- 1K<n<10K
source_datasets:
- original
tags:
- latin-american
- newspapers
- 19th-century
- 1800-1900
- research
- spanish
task_categories:
- fill-mask
- text-retrieval
- text-classification
task_ids:
- slot-filling
- masked-language-modeling
- document-retrieval
- dialogue-generation
- multi-label-classification
- entity-linking-classification
- sentiment-classification
- semantic-similarity-scoring
- semantic-similarity-classification
- sentiment-scoring
- sentiment-analysis
- topic-classification
- multi-input-text-classification
- multi-class-classification
- hate-speech-detection
configs:
- config_name: corrected
data_files: "corrected-latam-xix.parquet"
default: true
- config_name: cleaned
data_files: "cleaned-latam-xix.parquet"
- config_name: original
data_files: "original-latam-xix.parquet"
- config_name: chunked
data_files: "chunked-latam-xix.parquet"
---
提供机构:
Flaglab
原始信息汇总
数据集概述
基本信息
- 名称: Latin-American XIX Century Spanish Corpus
- 语言: 西班牙语(es)
- 语言创建方式: 众包(crowdsourced)和机器生成(machine-generated)
- 许可证: MIT
- 多语言性: 单语种
- 大小: 1K<n<10K
- 数据来源: 原始数据
标签
- 拉丁美洲
- 报纸
- 19世纪
- 1800-1900
- 研究
- 西班牙语
任务类别
- 填空
- 文本检索
- 文本分类
具体任务
- 槽填充
- 掩码语言建模
- 文档检索
- 对话生成
- 多标签分类
- 实体链接分类
- 情感分类
- 语义相似度评分
- 语义相似度分类
- 情感评分
- 情感分析
- 主题分类
- 多输入文本分类
- 多类别分类
- 仇恨言论检测
配置
- corrected:
- 数据文件: corrected-latam-xix.parquet
- 默认: 是
- cleaned:
- 数据文件: cleaned-latam-xix.parquet
- original:
- 数据文件: original-latam-xix.parquet



