Annotated files of the Soviet Belarusian newspaper Zviazda (years 1938, 1939, 1940)
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8248762
下载链接
链接失效反馈官方服务:
资源简介:
The present data contain the texts of the Soviet Belarusian newspaper 'Zviazda' for the years 1938, 1940 and 1940 as a ZIP archive.
The data are in the CoNLL-U format, therefore morphologically and syntactically annotated.
The data were generated from PDF files through OCR (by Tesseract) and anotated with UDPipe. These data were generated while exploring a pipeline to process Belarusian texts and they are very noisy.
创建时间:
2023-10-10



