five

Corpus of Early English Correspondence Extension Sampler part 1 (CEECES 1)

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/4644243
下载链接
链接失效反馈
官方服务:
资源简介:
The Corpus of Early English Correspondence Extension Sampler part 1 (CEECES 1) is the first public release from the 18th-century part of the Corpora of Early English Correspondence (CEEC-400). The CEECES 1 forms one part of the full CEECES. The other parts, both released 11 April 2022, are the CEECES part 2 and the TCEECES. The CEECES 1 was originally released in April 2021; this new version (11 April 2022) contains the exact same data (corpus files and supporting metadata), only the new zip file has been cleaned of macOS metafiles (__MACOSX and .DS_Store files), the manual has been updated, the metadata is now provided as a tsv file and a MS Excel file, and a key to the metadata has been included in the bundle. See the accompanying manual for more information on the CEECES 1; see the manuals for the CEECES 2 and the TCEECES for more information on the CEECES. See https://varieng.helsinki.fi/CoRD/corpora/CEEC/ for more on the CEEC-400. Citation: CEECES 1 = Corpus of Early English Correspondence Extension Sampler part 1. Compiled by Terttu Nevalainen, Helena Raumolin-Brunberg, Samuli Kaislaniemi, Mikko Laitinen, Minna Nevala, Arja Nurmi, Minna Palander-Collin, Tanja Säily and Anni Sairio at the Department of Languages, University of Helsinki. XML conversion and encoding by Lassi Saario. Helsinki: VARIENG, 2021.   Version history: 1.4.2021 Version 1 – First release of corpus texts, metadata, and manual. 11.4.2022 Version 2 – Second release, to coincide with the release of CEECES 2 and TCEECES. Zip file cleaned, manual updated.
创建时间:
2024-07-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作