ColloCaid Sample Data
收藏figshare.com2023-05-30 更新2025-03-25 收录
下载链接:
https://figshare.com/articles/dataset/ColloCaid_Sample_Data/13028207/2
下载链接
链接失效反馈官方服务:
资源简介:
COLLOCAID SAMPLE
DATAThe ColloCaid Sample
Data comprises approximately 2% of the ColloCaid lexical database. The sample
covers 692 strong academic English collocations (LogDice >5.0) for 16 core
academic lemmas used as collocation bases (or nodes): 5 nouns, 5 verbs, and 6
adjectives. The selection aims to give an overview of the range of data
included in the full dataset. This includes collocations with bases classified
with more than one part-of-speech tag (e.g. DEBATE, INDIVIDUAL), polysemous
collocation bases giving rise to distinct collocation patterns (e.g. CODE), as
well as collocation bases that evoke a very large and a very small number of
collocations. The strongest eight lexical collocations listed for each base are
enriched with three different curated example sentences adapted from corpora of
expert academic English writing. COLLOCAID LEXICAL
DATA 1.1The full ColloCaid
lexical dataset consists of:• 572 core academic
English lemmas (311
nouns, 184 verbs and 77 adjectives)• 32,645 academic
collocations with the above lemmas• 29,028 example
sentences of collocations in context
Further information at http://www.collocaid.uk/
COLLOCAID 样本数据集
数据集约占 ColloCaid 词汇数据库的 2%,涵盖了 692 个强学术英语搭配(LogDice >5.0),这些搭配以 16 个核心学术词素作为搭配基础(或节点):5 个名词、5 个动词和 6 个形容词。该样本的选取旨在提供一个全面数据集内容的概览。其中包括被归类为多个词性标签的搭配基础(例如:DEBATE,INDIVIDUAL)、具有多种搭配模式的同义搭配基础(例如:CODE),以及能够激发大量或少量搭配的搭配基础。对于每个基础,列出了最强的八个词汇搭配,并增添了三个经过精心挑选的例句,这些例句取自专家学术英语写作的语料库。
ColloCaid 词汇数据集 1.1 完整版包括:
• 572 个核心学术英语词素(311 个名词、184 个动词和 77 个形容词)
• 与上述词素相关的 32,645 个学术搭配
• 29,028 个搭配语境示例句
更多详细信息请访问 http://www.collocaid.uk/
提供机构:
figshare.com



