five

Corpus of Global Web Based English (GloWbE)

收藏
DataCite Commons2024-08-16 更新2025-04-16 收录
下载链接:
https://datasets.lib.berkeley.edu/citation?persistentId=doi:10.60503/D3/FWOSXY
下载链接
链接失效反馈
官方服务:
更多采购需求
资源简介:
The corpus of Global Web-based English (GloWbE; pronounced "globe") is unique in the way that it allows you to carry out comparisons between different varieties of English. GloWbE is related to other corpora from English-Corpora.org, which are the most widely used corpora of English, and which offer unparalleled insight into variation in English. GloWbE contains about 1.9 billion words of text from twenty different countries. This makes it about 100 times as large as other corpora like the International Corpus of English, and it allows for many types of searches that would not be possible otherwise. In addition to this online interface, you can also download full-text data from the corpus. Click on any of the links in the search form to the left for context-sensitive help. You might pay special attention to the comparisons between countries and virtual corpora, which allow you to create personalized collections of texts related to a particular area of interest. English-Corpora: GloWbE
提供机构:
UC Berkeley Library Dataverse
创建时间:
2024-08-16

社区讨论

【我遇到的问题】 • 现象:该数据集的下载链接已失效 【相关信息】 • 可考虑访问这个链接获取类似文件~https://www.selectdataset.com/dataset/3688356173feccbcf1f1e490ddc6bc72

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作