five

Corpus of metaphorical expressions in spoken Slovene language G-KOMET 1.0

收藏
hdl.handle.net2025-03-26 收录
下载链接:
http://hdl.handle.net/11356/1490
下载链接
链接失效反馈
官方服务:
资源简介:
G-KOMET (a corpus of metaphorical expressions in spoken Slovene language) is an upgrade of the hand-annotated written corpus for metaphorical expressions KOMET (http://hdl.handle.net/11356/1293) with transcriptions of speech and conversation that covers 50,000 lexical units. The corpus contains samples from the Gos corpus of spoken Slovene (http://hdl.handle.net/11356/1438) and includes a balanced set of transcriptions of informative, educational, entertaining, private, and public discourse. It contains hand-annotated metaphor-related words, i.e. linguistic expressions that have the potential for people to interpret them as metaphors, idioms, i.e. multi-word units in which at least one word has been used metaphorically, and metonymies, expressions that we use to express something else. The annotation scheme was based on the MIPVU metaphor identification process. This protocol was modified and adapted to the specifics of the Slovene language and the specifics of the spoken language. Corpus was annotated for the following relations to metaphor: indirect metaphor, direct metaphor, borderline cases and metaphor signals. In addition, the corpus introduces a new ‘frame’ tag, which gives information about a concept to which it refers. This conceptual frame allows us to search for figurative expressions within a specific context category (e.g. time, spatial orientation, emotions etc.). Metonymies were furthermore categorized based on the specific metonymic mapping. Corpus of metaphorical expressions in spoken Slovene language G-KOMET allows an objective and systematic analysis of metaphorical expressions, metaphors and metonymies in various Slovene texts.

G-KOMET(斯洛文尼亚口语隐喻表达语料库)是对手工标注的书面隐喻表达语料库KOMET(http://hdl.handle.net/11356/1293)的升级,其中包含了语音和对话的转录,涵盖了50,000个词汇单位。该语料库汇集了来自Gos斯洛文尼亚口语语料库(http://hdl.handle.net/11356/1438)的样本,并包含了一系列平衡的转录,涉及信息性、教育性、娱乐性、私密性和公共话语。它包含手工标注的与隐喻相关的词汇,即具有被人们解读为隐喻、成语(即至少有一个词被隐喻使用的多词单位)和转喻的潜在的语言表达,转喻是用于表达其他事物的表达。标注方案基于MIPVU隐喻识别流程,该协议经过修改和调整,以适应斯洛文尼亚语言和口语的具体特点。语料库针对以下与隐喻的关系进行了标注:间接隐喻、直接隐喻、边界案例和隐喻信号。此外,语料库引入了一个新的‘框架’标签,它提供了关于所指概念的信息。这一概念框架使我们能够在特定的语境类别(例如时间、空间方向、情感等)内搜索比喻性表达。转喻还根据特定的转喻映射进行了分类。斯洛文尼亚口语隐喻表达语料库G-KOMET允许对各种斯洛文尼亚文本中的隐喻表达、隐喻和转喻进行客观和系统的分析。
提供机构:
hdl.handle.net
二维码
社区交流群
二维码
科研交流群
商业服务