IceMorph morphological analysis data files
收藏NIAID Data Ecosystem2026-03-08 收录
下载链接:
http://datadryad.org/dataset/doi%253A10.5068%252FD1WC7K
下载链接
链接失效反馈官方服务:
资源简介:
This dataset consists of four main resources: a concatenated dictionary of Old Icelandic parsed for word class and inflectional detail; a corpus of Old Icelandic sagas in plain text and chunked by chapter; a tagged version of the same text, output of the IceMorph system; a training corpus labeled "Expert" for training and testing a machine learning module; and a training corpus labeled "Gold" for training and testing a machine learning module.
Methods
Datasets (1) dictionary (2a) saga texts were generated using OCR. Dataset (2b) is the output of the IceMorph tagging system. Datasets (3a) and (3b) were generated by hand-tagging.
创建时间:
2014-06-09



