nltk-data-hub/dolch
收藏Hugging Face2026-04-28 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/nltk-data-hub/dolch
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: dolch
data_files:
- split: dolch
path: data/dolch/dolch.parquet
- config_name: dolch-adjectives
data_files:
- split: dolch
path: data/dolch-adjectives/dolch.parquet
- config_name: dolch-nouns
data_files:
- split: dolch
path: data/dolch-nouns/dolch.parquet
- config_name: dolch-verbs
data_files:
- split: dolch
path: data/dolch-verbs/dolch.parquet
- config_name: dolch-adverbs
data_files:
- split: dolch
path: data/dolch-adverbs/dolch.parquet
- config_name: dolch-prepositions
data_files:
- split: dolch
path: data/dolch-prepositions/dolch.parquet
- config_name: dolch-pronouns
data_files:
- split: dolch
path: data/dolch-pronouns/dolch.parquet
- config_name: dolch-conjunctions
data_files:
- split: dolch
path: data/dolch-conjunctions/dolch.parquet
license: other
task_categories:
- token-classification
pretty_name: NLTK Dolch Sight Word List
---
# NLTK Dolch Sight Word List
The 315 Dolch sight words (Dolch 1936), grouped by part of speech, distributed
via [NLTK](https://www.nltk.org/).
## Configs
| Config | Words | Schema |
|---|---|---|
| `dolch` | 315 | `word, pos` |
| `dolch-adjectives` | 46 | `word` |
| `dolch-nouns` | 95 | `word` |
| `dolch-verbs` | 92 | `word` |
| `dolch-adverbs` | 34 | `word` |
| `dolch-prepositions` | 16 | `word` |
| `dolch-pronouns` | 26 | `word` |
| `dolch-conjunctions` | 6 | `word` |
## Schema
**`dolch`** — combined list with part-of-speech
| Column | Type | Description |
|---|---|---|
| `word` | string | The sight word |
| `pos` | string | Part of speech (adjectives, nouns, verbs, …) |
**`dolch-*`** — word only
| Column | Type | Description |
|---|---|---|
| `word` | string | The sight word |
## Usage
```python
from datasets import load_dataset
ds = load_dataset("nltk-data-hub/dolch", "dolch") # all 315, with pos
ds = load_dataset("nltk-data-hub/dolch", "dolch-verbs") # verbs only
```
## Via NLTK
```python
import nltk
nltk.download("words", hf=True)
nltk.corpus.words.words("dolch") # 315 Dolch sight words
nltk.corpus.words.words("dolch-verbs") # 92 verbs
nltk.corpus.words.words("dolch-nouns") # 95 nouns
```
## License
Public domain — Dolch (1936), published work now in the public domain.
## Citation
```bibtex
@article{dolch,
author = {Dolch, Edward William},
title = {A Basic Sight Vocabulary},
journal = {The Elementary School Journal},
volume = {36},
number = {6},
pages = {456--460},
year = {1936},
doi = {10.1086/457353}
}
```
提供机构:
nltk-data-hub



