Data from 'Dative absolutes in discourse: the value of deeply versus strategically annotated treebanks'
收藏DataCite Commons2022-06-27 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/Data_from_Dative_absolutes_in_discourse_the_value_of_deeply_versus_strategically_annotated_treebanks_/12894035/1
下载链接
链接失效反馈官方服务:
资源简介:
This project contains all the datasets used in the paper 'Early Slavic dative absolutes in discourse: the value of deeply versus strategically annotated treebanks'.<br><br>- 'egda_raw.csv' contains all egda-clauses in the Codex Marianus. The only part which has been manipulated is where two subjects were coordinated by *i* 'and'. In these cases, an extra row was created, allowing both subjects to appear in the ocs_sub_lemma column. The row containing the second subject was left empty under all but the subject lemma variable. This allows to observe frequencies regarding lexical variation among egda-clauses' subjects, but at the same time to discard those rows when dealing with other variables.<br>- 'egda_manipulated.csv' considers all bystъ-clauses as pre-matrix.<br>- 'DA_Marianus_raw.csv' contains all dative absolutes in the Codex Marianus, as well as genitive absolutes for which there is an OCS parallel. It lists as separate entries both multiple dative participles with one dative subjects, and multiple dative subjects with one dative participle.<br>E.g.:<br>1) бꙑвъши же печали и гонению словесе ради абье съблажнѣатъ сѧ2) и въшедъши дъштери еѩ иродиѣдѣ. i плѧсавъши и оугождъши иродови<br>Both 1) and 2) are listed as multiple entries, although only 2) has technically more than one dative absolute.DA_Marianus_abridged.csv: This is the same as DA_Marianus_raw.csv, but lists as one dative absolute instances with multiple dative subjects and one dative participle. The criterion chosen was to only retain the entry for the subject which was the closest to the participle (the choice can make a difference should one want to consider the properties of a dative absolute with respect to its subjects).<br><br>- 'DA_Marianus_manipulated.csv' (starting from DA_Marianus_abridged.csv) treats all dative absolutes in bystъ-clauses as pre-matrix.<br>- 'DA_nogr_raw.csv' contains all the dative absolutes in the second case study (early Slavic texts without Greek parallels)<br>- 'DA_nogr_harm.csv' contains the same dative absolutes as DA_nogr_raw.csv but with harmonized Church Slavonic and Old East Slavic spellings.<br>- harmonize.py: script used to harmonize the Church Slavonic and Old East Slavic spellings in the paper's second case study.<br>The reader interested in reproducing the results of the paper should refer to the 'manipulated' versions of both the egda-clause and the dative absolute datasets.<br>
提供机构:
figshare
创建时间:
2020-08-29



