five

A Full Morphosyntactic Annotation of the State Archives of Assyria Letter Corpus

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10622983
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset consists of a full morphosyntactic annotation of the normalized letter corpus of the State Archives of Assyria online (SAAo), plus associated metadata regarding sender, recipient, estimated date of composition, script, and dialect of Akkadian (if determinable). This corpus comprises ten of the twenty-one current volumes of SAAo and contains approximately 2600 letters from the royal archives of the late Neo-Assyrian kings. Each letter features morphosyntactic annotations specifying part of speech, lemma, morphological decomposition, and syntactic dependencies of all relevant tokens in the text. The annotations were made with the help of a spaCy language model with additional human checking and completion. The annotations are available both as a set of CONLLU files (one per text) and as linked open data in a single TTL file. The associated metadata is available as a CSV file and a TTL. Due to the letters' shared format, topics of concern, and historical period in which they were written, this corpus forms a natural object of study from a linguistic and social historical perspective. It is hoped this data will be of use to researchers wishing to do linguistic and sociolinguistic corpus research on these texts.
创建时间:
2024-02-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作