five

Dataset of Middle Dutch lexical stress patterns and syllabifications

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/2582975
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset consists of 48.219 Middle Dutch words taken from in total 205 rhymed texts of the Cd-rom Middelnederlands (1998). All of these words have been assigned a syllabification and lexical stress pattern. E.g.: proevede is syllabified as proe-ve-de and has a stress index set at -3, which means that – counting from the rightmost syllable – the third syllable receives stress. This upload contains the following files: The JSON-file (compressed), which was used as input data for a machine learning algorithm trained for the automatic syllabification and stress assignment of Middle Dutch polysyllabic words (for the code of this experiment, see GitHub) An Excel-file, containing the same data as the JSON (for more convenient reference) A split file (compressed), used in the training proces of the above-mentioned experiment A pdf-file with some insightful illustrations about the contents of the dataset This dataset is part of the research of Wouter Haverals (FWO, University of Antwerp), carried out under the supervision of prof. Mike Kestemont and em. prof. Frank Willaert.
创建时间:
2024-07-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作