gustawdaniel/ngram-google-2012
收藏Hugging Face2023-04-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/gustawdaniel/ngram-google-2012
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-3.0
---
```
python -m spacy download en_core_web_sm
```
Titles:
```
jq -s '.[].title' raw/dict.jsonl
```
returns
- [x] "English"
- [ ] "English One Million"
- [x] "American English"
- [x] "British English"
- [x] "English Fiction"
- [ ] "Chinese (simplified)"
- [x] "French"
- [x] "German"
- [ ] "Hebrew"
- [ ] "Italian"
- [x] "Russian"
- [x] "Spanish"
Spellcheck:
https://pypi.org/project/pyspellchecker/
```
English - ‘en’
Spanish - ‘es’
French - ‘fr’
Portuguese - ‘pt’
German - ‘de’
Russian - ‘ru’
Arabic - ‘ar’
```
Sets now:
- [x] "English" - en
- [x] "Spanish" - es
- [x] "French" - fr
- [x] "German" - de
- [x] "Russian" - ru
提供机构:
gustawdaniel
原始信息汇总
数据集概述
数据集名称
- "English"
- "American English"
- "British English"
- "English Fiction"
- "French"
- "German"
- "Russian"
- "Spanish"
数据集语言代码
- "English" - en
- "Spanish" - es
- "French" - fr
- "German" - de
- "Russian" - ru
许可证
- cc-by-3.0



