IceMorph morphological analysis data files
收藏DataCite Commons2026-04-10 更新2026-04-25 收录
下载链接:
https://datadryad.org/dataset/doi:10.5068/D1WC7K
下载链接
链接失效反馈官方服务:
资源简介:
This dataset consists of four main resources: a concatenated dictionary of
Old Icelandic parsed for word class and inflectional detail; a corpus of
Old Icelandic sagas in plain text and chunked by chapter; a tagged version
of the same text, output of the IceMorph system; a training corpus labeled
"Expert" for training and testing a machine learning module; and
a training corpus labeled "Gold" for training and testing a
machine learning module.
本数据集包含四大核心资源:其一为经词性标注与屈折细节解析的古冰岛语拼接词典;其二为采用纯文本格式且按章节分块的古冰岛萨迦语料库;其三为该文本的标注版本,由IceMorph系统生成;其四为分别标注为"Expert"与"Gold"的机器学习模块训练与测试语料库。
提供机构:
Dryad
创建时间:
2014-06-09



