中世纪希伯来诗歌隐喻检测数据集
收藏arXiv2024-02-27 更新2024-07-23 收录
下载链接:
https://tokeron.github.io/metaphor/
下载链接
链接失效反馈官方服务:
资源简介:
中世纪希伯来诗歌隐喻检测数据集是由以色列理工学院和以色列开放大学的研究人员创建的,专注于第五至第八世纪的希伯来礼拜诗歌,即Piyyut。该数据集包含309首诗歌,总计73,179个单词,并由专家对隐喻表达进行了标注。尽管规模相对较小,但该数据集占已数字化的Piyyut的15%,并且是唯一一个在希伯来语中进行隐喻标注的语料库。数据集的创建过程涉及从开罗藏书库的古代手稿中重建诗歌,并通过CATMA工具进行数字化和标注。该数据集主要应用于文学和语言学领域,旨在帮助学者和非专业人士更好地理解和分析中世纪希伯来诗歌中的隐喻使用,从而推动对这一时期文学和语言的理解。
The Medieval Hebrew Poetry Metaphor Detection Dataset was developed by researchers from the Technion – Israel Institute of Technology and The Open University of Israel, focusing on Hebrew liturgical poetry from the 5th to 8th centuries CE known as Piyyut. This dataset consists of 309 poems totaling 73,179 words, with metaphorical expressions annotated by domain experts. Despite its relatively small scale, the dataset accounts for 15% of all digitized Piyyut works, and it is the only annotated corpus for metaphor detection in the Hebrew language. The dataset creation process involved reconstructing poems from ancient manuscripts housed in the Cairo Genizah, followed by digitization and annotation via the CATMA tool. Primarily applied in literary and linguistic research, this dataset aims to help both scholars and non-professionals better understand and analyze the use of metaphors in medieval Hebrew poetry, thus promoting deeper comprehension of the literature and language of this historical period.
提供机构:
以色列理工学院
创建时间:
2024-02-27



