five

ColloCaid Sample Data

收藏
figshare.com2023-05-30 更新2025-03-25 收录
下载链接:
https://figshare.com/articles/dataset/ColloCaid_Sample_Data/13028207/2
下载链接
链接失效反馈
官方服务:
资源简介:
COLLOCAID SAMPLE DATAThe ColloCaid Sample Data comprises approximately 2% of the ColloCaid lexical database. The sample covers 692 strong academic English collocations (LogDice >5.0) for 16 core academic lemmas used as collocation bases (or nodes): 5 nouns, 5 verbs, and 6 adjectives. The selection aims to give an overview of the range of data included in the full dataset. This includes collocations with bases classified with more than one part-of-speech tag (e.g. DEBATE, INDIVIDUAL), polysemous collocation bases giving rise to distinct collocation patterns (e.g. CODE), as well as collocation bases that evoke a very large and a very small number of collocations. The strongest eight lexical collocations listed for each base are enriched with three different curated example sentences adapted from corpora of expert academic English writing. COLLOCAID LEXICAL DATA 1.1The full ColloCaid lexical dataset consists of:• 572 core academic English lemmas (311 nouns, 184 verbs and 77 adjectives)• 32,645 academic collocations with the above lemmas• 29,028 example sentences of collocations in context Further information at http://www.collocaid.uk/

COLLOCAID 样本数据集 数据集约占 ColloCaid 词汇数据库的 2%,涵盖了 692 个强学术英语搭配(LogDice >5.0),这些搭配以 16 个核心学术词素作为搭配基础(或节点):5 个名词、5 个动词和 6 个形容词。该样本的选取旨在提供一个全面数据集内容的概览。其中包括被归类为多个词性标签的搭配基础(例如:DEBATE,INDIVIDUAL)、具有多种搭配模式的同义搭配基础(例如:CODE),以及能够激发大量或少量搭配的搭配基础。对于每个基础,列出了最强的八个词汇搭配,并增添了三个经过精心挑选的例句,这些例句取自专家学术英语写作的语料库。 ColloCaid 词汇数据集 1.1 完整版包括: • 572 个核心学术英语词素(311 个名词、184 个动词和 77 个形容词) • 与上述词素相关的 32,645 个学术搭配 • 29,028 个搭配语境示例句 更多详细信息请访问 http://www.collocaid.uk/
提供机构:
figshare.com
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作