ICE Nigeria
收藏DataCite Commons2026-02-19 更新2024-07-13 收录
下载链接:
https://datastore.uni-muenster.de/doi/10.17879/26968583686
下载链接
链接失效反馈官方服务:
资源简介:
This is the Nigerian component of the International Corpus of English, a one million word corpus of written and spoken Nigerian English for linguistic research. It can be used as a stand-alone corpus or in conjunction with other components of the International Corpus of English (such as ICE-GB, ICE-India, etc.) to compare international varieties of English.
The corpus consists of several parts: the written part is avilable as text files, xml files and xml files with parts of speech tagging. For the spoken part, the eaf files (ELAN files in xml format) together with the text files are available. The corresponding sound files can be downloaded in a separate file.
In addition, we provide the corpus manual as well as a spreadsheet with metadata (speaker age, gender, ethnic group and profession) and XML specifications.
本数据集为《国际英语语料库》(International Corpus of English)的尼日利亚分支,是包含一百万词的尼日利亚书面与口语英语语料库,适用于语言学研究。该语料库可独立使用,亦可与《国际英语语料库》的其他分支(如ICE-GB、ICE-India等)结合,用于对比不同国际变体英语。语料库包含多个子部分:书面语部分可提供文本文件、XML文件以及带词性标注的XML文件;口语语料部分则提供EAF文件(XML格式的ELAN文件)与对应文本文件,配套的音频文件可通过单独文件包下载。此外,本数据集还附带语料库使用手册,以及包含元数据(说话人年龄、性别、族群与职业)和XML规范的电子表格。
提供机构:
University of Münster
创建时间:
2024-05-23



