five

Ukrainian Epigraphic Corpus: Academic and Web-Based Texts (20th–21st Century)

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14795103
下载链接
链接失效反馈
官方服务:
资源简介:
Title:Ukrainian Epigraphic Corpus: Academic and Web-Based Texts (20th–21st Century) Description:This dataset comprises a corpus of Ukrainian epigraphic texts collected from academic publications, conference proceedings, and web-based sources. The corpus is designed to support linguistic analysis, term extraction, and the development of a Simple Knowledge Organization System (SKOS) vocabulary for Ukrainian epigraphy. The corpus includes 292 documents with over 1.29 million tokens and 778,104 words, reflecting a comprehensive linguistic and historical representation of Ukrainian inscriptions. Texts span from the second half of the 20th century to 2024 and cover diverse regions within Ukraine, such as Kyiv, Halychyna, and Chernihiv. The sources range from books and monographs to web-based epigraphic discussions, ensuring both academic rigor and contemporary relevance. Data processing was conducted using Sketch Engine, including tokenization, lemmatization, and part-of-speech tagging to facilitate accurate term identification and frequency analysis. This corpus is particularly valuable for researchers in epigraphy, linguistics, digital humanities, and terminology studies. Corpus Structure: Academic Sub-Corpus: 18 documents (books, articles, encyclopedic entries) Web Sub-Corpus: 274 documents (web articles, blogs, project websites) License:This dataset is released under the CC BY 4.0 license, allowing for reuse and adaptation with proper attribution. Citation:Ukrainian Epigraphic Text Corpus (2024). Available at Zenodo: [Insert DOI]
创建时间:
2025-02-03
二维码
社区交流群
二维码
科研交流群
商业服务