Weak lexical equivalents in Russian → Belarusian translation: approaches to identification (dataset)
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8371719
下载链接
链接失效反馈官方服务:
资源简介:
Data accompanying the talk:
А. А. Ваўчок, У. В. Парыцкі. Слабыя лексічныя адпаведнікі ў руска-беларускім перакладзе: падыходы да ідэнтыфікацыі // XI Міжнародны Кангрэс даследчыкаў Беларусі, Гданьск, 23.09.2023 [Oksana Volchek, Vladislav Poritski. Weak lexical equivalents in Russian → Belarusian translation: approaches to identification // Presented at 11th International Congress of Belarusian Studies, Gdańsk, 23.09.2023]
The file data.csv is a test battery of 100 contexts in Russian, expected to cause difficulties when attempting to translate them into Belarusian. This is because each context contains a pair of words distinct in Russian but lacking clearly distinct Belarusian lexical equivalents. The contexts were taken, often with some simplifications, from various sources available on the web, i.e. they are representative of real usage, not artificially constructed.
The file has four columns:
w1, w2 – two words belonging to the same part of speech (typically adjective, noun, or verb), w1 lexicographically preceding w2;
context – a snippet of text with both words (typically a single sentence, sometimes longer);
inspired_by – URL of a web page or file containing the original version of the context, which may have been slightly abridged or reworded for illustrative purposes. Archive copies of most URLs, with the exception of Google Books links, are available in the Wayback Machine (https://web.archive.org) or in https://archive.today.
创建时间:
2023-09-23



