Replication data for: Old Church Slavonic byti Part One and Part Two
收藏DataONE2022-07-18 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:1a9f0906fd1aebee549d5a6b141ff4ed1ed0cb6a7ed5995b781df6faa573c1ae
下载链接
链接失效反馈官方服务:
资源简介:
Abstract Part One. There is controversy over whether byti ‘be’ in Old Church Slavonic functioned as an imperfective verb with an unusually large number of inflected forms or as an aspectual pair of verbs, reflecting its suppletive origin from two stems (es- and bū-). We offer an objective empirical approach to the status of this verb, using statistical analysis of 2,428 attestations of byti in comparison with 9,694 attestations of 129 other verbs. This makes it possible to accurately locate byti in the context of the verbal lexicon of Old Church Slavonic. The comparison is made via grammatical profiles, a method that examines the frequency distribution of each verb’s inflected forms. This comparison is undertaken in two rounds, one assuming that byti is a single verb, and the other assuming that it is a pair of verbs. Both assumptions yield reasonable results, and although the grammatical profile analyses do not suffice to solve the controversy, they lay the groundwork for further analysis in Part Two that argues for a single-verb interpretation of byti. Data and R Scripts Part One: The Dat a Our analysis uses two datasets, one that presents the forms of byti as a single paradigm, verbs.csv, and one that presents it as a pair of verbs, splitverbs.csv. The R Scripts In order to represent the Church Slavonic orthography, you will need our transliteration script: translit.r. This script is sourced by the scripts for our analysis which present byti as either a single verb or a verb pair: PartOneSingleVerb.r and PartOneVerbPair.r . This script performs all of the steps for the analysis in our article and generates the plots.
Abstract Part Two: The verb byti ‘be’ in Old Church Slavonic appears in an unusually rich inventory of grammatical constructions that it appears in. We analyze corpus data on the distribution of constructions in order to assess the status of this verb as either a single verb or an aspectual pair of verbs. Our study moves beyond a strict structuralist interpretation of the behavior of byti, instead recognizing the real variation and ambiguity in the data. Our findings make both theoretical and descriptive advances. The radial category structure is a central tenet of cognitive linguistics, but until now such structures have usually been posited by researchers based on their qualitative insights from data. We show that it is possible to identify both the nodes and the structure of a radial category statistically, using only linguistic data as input. We provide an enhanced description of byti that clearly distinguishes between core uses and those that are more peripheral and shows the relationships among them. While we find some evidence in support of an aspectual pair, most evidence points instead toward a single verb. Data and R Script Part Two: The Data The dataset used in this analysis is frames.csv. The R Script The R script used in this analysis is PartTwo.r.
第一部分 摘要
关于古教会斯拉夫语(Old Church Slavonic)中的动词byti(意为“是”),学界始终存在争议:其究竟是拥有异常丰富变位形式的非完成体动词(imperfective verb),还是一对体对立动词(aspectual pair)?这一争议源于其异干互补(suppletive)词源——源自es-与bū-两个词干。我们采用客观的实证研究方法探究该动词的地位:通过对2428条byti的语料记录与129个其他动词共9694条语料记录进行统计分析,得以将byti精准定位至古教会斯拉夫语动词词汇系统的语境之中。
本次比较采用语法轮廓(grammatical profiles)分析法,该方法会考察每个动词变位形式的频率分布情况。比较分为两轮开展:一轮假设byti为单一动词,另一轮假设其为一对体对立动词。两种假设均得到了合理的结果;尽管语法轮廓分析法尚不足以解决这一争议,但它们为第二部分的进一步分析奠定了基础——第二部分将论证byti应被视为单一动词。
第一部分 数据与R脚本
数据集:本分析使用两类数据集:一类将byti的变位形式作为单一范式呈现,对应文件为verbs.csv;另一类将其拆分为一对体对立动词,对应文件为splitverbs.csv。
R脚本:为正确呈现教会斯拉夫语正字法,需使用我们提供的转写脚本translit.r。本分析中用于呈现byti为单一动词或动词对的脚本均会调用该转写脚本,分别为PartOneSingleVerb.r与PartOneVerbPair.r。该脚本可完成本文分析的全部步骤,并生成相关图表。
第二部分 摘要
古教会斯拉夫语中的动词byti(意为“是”)可出现在异常丰富的语法构式集合中。我们通过分析语料库中构式分布的数据,评估该动词究竟是单一动词还是一对体对立动词。本研究突破了对byti行为的严格结构主义解读(structuralist interpretation),转而承认数据中存在的真实变异与歧义。我们的研究成果在理论与描述层面均取得了进展。
辐射范畴结构(radial category structure)是认知语言学的核心要义之一,但直至此前,这类结构通常由研究者基于其对数据的定性洞察而提出。我们证明,仅以语言数据作为输入,即可通过统计方法识别辐射范畴的节点与结构。我们提供了对byti的精细化描述,该描述可清晰区分核心用法与边缘用法,并展现了各类用法间的关联。尽管我们找到了一些支持体对立动词假说的证据,但绝大多数证据均指向单一动词的解读。
第二部分 数据与R脚本
数据集:本分析使用的数据集为frames.csv。
R脚本:本分析使用的R脚本为PartTwo.r。
创建时间:
2024-01-05



