five

Annotated files of the Soviet Belarusian newspaper Zviazda (years 1938, 1939, 1940)

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8248762
下载链接
链接失效反馈
官方服务:
资源简介:
The present data contain the texts of the Soviet Belarusian newspaper 'Zviazda' for the years 1938, 1940 and 1940 as a ZIP archive. The data are in the CoNLL-U format, therefore morphologically and syntactically annotated. The data were generated from PDF files through OCR (by Tesseract) and anotated with UDPipe. These data were generated while exploring a pipeline to process Belarusian texts and they are very noisy.
创建时间:
2023-10-10
二维码
社区交流群
二维码
科研交流群
商业服务