five

Clipped forms in English

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://data.mendeley.com/datasets/2rw5vtz3xh
下载链接
链接失效反馈
官方服务:
资源简介:
The raw data presented are compiled into smaller groups depending on the number of syllables of the source words. The truncated item appears in the round bracket, as exemplified in sergeant (sarge) and body (bod) among many others. There is no monosyllabic word in English, which radically reduces. Therefore, the disyllabics are the smallest number of syllables contained in the English word which are subject to word shortening. The total of 750 original words are analyzed into 235 disyllabics, 273 tri-syllabics, 150 quad-syllabics, 70 penta-syllabics, 20 hexa-syllabics and 2 hepta-syllabics. Additionally, 39 source words with varied clipped forms are included in the corpus. The sources are Macquarie Dictionary (1984), Minkova (2018), Quirk et al. (1985), Thorndike and Barnhart (1997), Random House Webster’s College Dictionary (2001), and Collins Online Dictionary (2020). Since as of today there is no data of English truncated items freely available, the way in which the data was collected has some value on its own merit. When the lexical entry of one shortened form was printed in a written text, the form was added to the list of truncated words; nearly every page of the text sources has been investigated.
创建时间:
2021-04-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作