five

nltk-data-hub/dolch

收藏
Hugging Face2026-04-28 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/nltk-data-hub/dolch
下载链接
链接失效反馈
官方服务:
资源简介:
--- configs: - config_name: dolch data_files: - split: dolch path: data/dolch/dolch.parquet - config_name: dolch-adjectives data_files: - split: dolch path: data/dolch-adjectives/dolch.parquet - config_name: dolch-nouns data_files: - split: dolch path: data/dolch-nouns/dolch.parquet - config_name: dolch-verbs data_files: - split: dolch path: data/dolch-verbs/dolch.parquet - config_name: dolch-adverbs data_files: - split: dolch path: data/dolch-adverbs/dolch.parquet - config_name: dolch-prepositions data_files: - split: dolch path: data/dolch-prepositions/dolch.parquet - config_name: dolch-pronouns data_files: - split: dolch path: data/dolch-pronouns/dolch.parquet - config_name: dolch-conjunctions data_files: - split: dolch path: data/dolch-conjunctions/dolch.parquet license: other task_categories: - token-classification pretty_name: NLTK Dolch Sight Word List --- # NLTK Dolch Sight Word List The 315 Dolch sight words (Dolch 1936), grouped by part of speech, distributed via [NLTK](https://www.nltk.org/). ## Configs | Config | Words | Schema | |---|---|---| | `dolch` | 315 | `word, pos` | | `dolch-adjectives` | 46 | `word` | | `dolch-nouns` | 95 | `word` | | `dolch-verbs` | 92 | `word` | | `dolch-adverbs` | 34 | `word` | | `dolch-prepositions` | 16 | `word` | | `dolch-pronouns` | 26 | `word` | | `dolch-conjunctions` | 6 | `word` | ## Schema **`dolch`** — combined list with part-of-speech | Column | Type | Description | |---|---|---| | `word` | string | The sight word | | `pos` | string | Part of speech (adjectives, nouns, verbs, …) | **`dolch-*`** — word only | Column | Type | Description | |---|---|---| | `word` | string | The sight word | ## Usage ```python from datasets import load_dataset ds = load_dataset("nltk-data-hub/dolch", "dolch") # all 315, with pos ds = load_dataset("nltk-data-hub/dolch", "dolch-verbs") # verbs only ``` ## Via NLTK ```python import nltk nltk.download("words", hf=True) nltk.corpus.words.words("dolch") # 315 Dolch sight words nltk.corpus.words.words("dolch-verbs") # 92 verbs nltk.corpus.words.words("dolch-nouns") # 95 nouns ``` ## License Public domain — Dolch (1936), published work now in the public domain. ## Citation ```bibtex @article{dolch, author = {Dolch, Edward William}, title = {A Basic Sight Vocabulary}, journal = {The Elementary School Journal}, volume = {36}, number = {6}, pages = {456--460}, year = {1936}, doi = {10.1086/457353} } ```
提供机构:
nltk-data-hub
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作