LocalDoc/various_topics_articles_azerbaijan
收藏Hugging Face2024-03-19 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/LocalDoc/various_topics_articles_azerbaijan
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- az
license: cc-by-nc-4.0
size_categories:
- 100K<n<1M
task_categories:
- text-generation
- fill-mask
pretty_name: Articles Dataset in Azerbaijani
dataset_info:
features:
- name: article
dtype: string
splits:
- name: train
num_bytes: 1500707584
num_examples: 236443
download_size: 778425672
dataset_size: 1500707584
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
<h2>Articles Dataset in Azerbaijani</h2>
Description
This dataset contains various topics articles in Azerbaijani language. It was created in 2024 and contains 236k articles (approximately 1 million sentences).
License
The dataset is licensed under the Creative Commons Attribution-NonCommercial 4.0 International license. This license allows you to freely share and redistribute the dataset with attribution to the source but prohibits commercial use.
Contact information
If you have any questions or suggestions, please contact us at [v.resad.89@gmail.com].
提供机构:
LocalDoc
原始信息汇总
数据集概述
基本信息
- 语言: 阿塞拜疆语 (az)
- 许可证: 知识共享署名-非商业性使用 4.0 国际许可 (cc-by-nc-4.0)
- 大小范围: 10万<n<100万
- 任务类别:
- 文本生成
- 填充掩码
- 美观名称: 阿塞拜疆语文章数据集
数据集详情
- 特征:
- 名称: 文章
- 数据类型: 字符串
- 分割:
- 类型: 训练
- 字节数: 1500707584
- 示例数量: 236443
- 下载大小: 778425672
- 数据集大小: 1500707584
配置
- 配置名称: 默认
- 数据文件:
- 分割: 训练
- 路径: data/train-*



