Corpus of Historical American English (COHA)

Name: Corpus of Historical American English (COHA)
Creator: UCLA Dataverse
Published: 2023-04-27 16:42:30
License: 暂无描述

DataCite Commons2023-04-27 更新2025-04-16 收录

下载链接：

https://dataverse.ucla.edu/citation?persistentId=doi:10.25346/S6/6I8JL1

下载链接

链接失效反馈

官方服务：

资源简介：

The Corpus of Historical American English (COHA) is the largest structured corpus of historical English. It is related to many other corpora of English that we have created. These corpora were formerly known as the "BYU Corpora", and they offer unparalleled insight into variation in English. If you are interested in historical corpora, you might also look at our Google Books (see comparison), Hansard, and TIME corpora. COHA contains more than 475 million words of text from the 1820s-2010s (which makes it 50-100 times as large as other comparable historical corpora of English) and the corpus is balanced by genre decade by decade. The creation of the corpus results from a grant from the National Endowment for the Humanities (NEH) from 2008-2010. Access to material is limited to UCLA graduate students and faculty. Undergraduates please use the standard web interface for the corpora: https://www.english-corpora.org/coha/

提供机构：

UCLA Dataverse

创建时间：

2021-05-11

搜集汇总

数据集介绍

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集