Replication Data for: The many faces of \"možno\" in Russian and across Slavic. Corpus investigation of constructions with the modal možno (Chapter 3)
收藏DataONE2024-08-21 更新2025-04-26 收录
下载链接:
https://search.dataone.org/view/sha256:f2a3a52c4332763f63be670602687d02c6a3a1ea7ba282cce0eab73fdd83eed0
下载链接
链接失效反馈官方服务:
资源简介:
This dataset encompasses data from the Main corpus of the Russian National Corpus (RNC, ruscorpora.ru) used for analysis provided in Chapter 3 of the Introductory Chapter in the doctoral dissertation \"The many faces of \"možno\" in Russian and across Slavic. Corpus investigation of constructions with the modal možno\". Chapter 3 presents a study of 500 examples of Russian constructions with the modal word možno ‘can, be possible’. The query consisted of a single word možno without specification of a time period. The search returned 361 755 examples, 5000 examples were downloaded in the .xlsx format, pseudorandomized, and then the first 500 examples were extracted for the analysis. The data in the spreadsheet 01DataTheManyFacesOfMozno comprises these 500 examples. The data was collected in March 2023 from the RNC. All of the examples are semantically and syntactically annotated by hand based on the syntactic analyses given in the corpus. The syntactic and morphological categories used in the corpus are explained here https://ruscorpora.ru/corpus/main.
本数据集涵盖俄语国家语料库(Russian National Corpus,简称RNC,ruscorpora.ru)主语料库的数据,用于博士论文《俄语及斯拉夫语族中“možno”的多面性:带模态词možno的构式语料库研究》导论章节第三章的分析工作。第三章针对500例带模态词možno(意为“能够、可以”)的俄语构式展开研究。本次检索仅以单义词možno为查询词,未限定时间范围,共返回361755条例句。随后下载其中5000条.xlsx格式的例句并进行伪随机化处理,从中抽取前500条用于分析。电子表格01DataTheManyFacesOfMozno中收录的正是这500条例句。该数据集于2023年3月从RNC采集完成。所有例句均依据语料库提供的句法分析结果,由人工完成语义与句法标注。本数据集所采用的语料库句法及形态范畴说明可访问以下网址查阅:https://ruscorpora.ru/corpus/main。
创建时间:
2024-09-25



