ijazulhaq/pashto_corpus
收藏Hugging Face2023-03-03 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ijazulhaq/pashto_corpus
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
language:
- ps
tags:
- pashto corpus
- pashto pos tagging
- pashto word segmentation
- pashto tokenization
- pashto ner
- sentiment analysis in pashto
pretty_name: The Pashto Corpus
size_categories:
- 1M<n<10M
---
# Dataset Card for Pashto Corpus
## Dataset Description
- **Homepage:**
- **Repository:**
- **Paper:**
- **Leaderboard:**
- **Point of Contact:**
### Dataset Summary
This dataset card aims to be a base template for new datasets. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/datasetcard_template.md?plain=1).
### Supported Tasks and Leaderboards
[More Information Needed]
### Languages
[More Information Needed]
## Dataset Structure
### Data Instances
[More Information Needed]
### Data Fields
[More Information Needed]
### Data Splits
[More Information Needed]
## Dataset Creation
### Curation Rationale
[More Information Needed]
### Source Data
#### Initial Data Collection and Normalization
[More Information Needed]
#### Who are the source language producers?
[More Information Needed]
### Annotations
#### Annotation process
[More Information Needed]
#### Who are the annotators?
[More Information Needed]
### Personal and Sensitive Information
[More Information Needed]
## Considerations for Using the Data
### Social Impact of Dataset
[More Information Needed]
### Discussion of Biases
[More Information Needed]
### Other Known Limitations
[More Information Needed]
## Additional Information
### Dataset Curators
[More Information Needed]
### Licensing Information
[More Information Needed]
### Citation Information
[More Information Needed]
### Contributions
[More Information Needed]
提供机构:
ijazulhaq
原始信息汇总
Pashto Corpus 数据集概述
数据集描述
- 数据集名称: The Pashto Corpus
- 语言:
- Pashto (ps)
- 标签:
- Pashto Corpus
- Pashto POS Tagging
- Pashto Word Segmentation
- Pashto Tokenization
- Pashto NER
- Sentiment Analysis in Pashto
- 许可证: MIT
- 大小: 1M<n<10M
数据集结构
数据实例
[信息缺失]
数据字段
[信息缺失]
数据分割
[信息缺失]
数据集创建
数据筛选理由
[信息缺失]
源数据
初始数据收集和标准化
[信息缺失]
源语言生产者
[信息缺失]
注释
注释过程
[信息缺失]
注释者
[信息缺失]
个人和敏感信息
[信息缺失]
使用数据时的考虑
数据集的社会影响
[信息缺失]
讨论偏见
[信息缺失]
其他已知限制
[信息缺失]
附加信息
数据集维护者
[信息缺失]
许可信息
[信息已提供]
- 许可证: MIT
引用信息
[信息缺失]
贡献
[信息缺失]



