five

The morphologically glossed Rigveda - The Zurich annotation corpus revised and extended. Hosted by VedaWeb - Online Research Platform for Old Indic Texts.

收藏
Mendeley Data2024-05-10 更新2024-06-27 收录
下载链接:
https://zenodo.org/records/8410656
下载链接
链接失效反馈
官方服务:
资源简介:
This file contains morphological and lexicographic annotations for the Rigveda. It was created in the DFG-funded research project Vedaweb and used as source data for the linguistic research platform vedaweb.uni-koeln.de. Prof. Dr. Paul Widmer and Dr. Salvatore Scarlata from the "Institut für Vergleichende Sprachwissenschaft" (Universität Zürich) provided the VedaWeb project a Filemaker file that was later transformed in Cologne into an Excel file. This data contained a version of the Rigveda by Prof. Dr. A. Lubotsky ("Indo-European Linguistics", Leiden University) that had been morphosytactically annotated over the course of more than 10 years at the University of Zurich. It also contained for each token, if available, a reference to an entry in Grassmann's dictionary for the Rigveda. Modifications made by Jakob Halfmann and Natalie Korobzow to the data in 2020: Disambiguation of the relevant categories, if unspecified in Zurich data, according to the Grassmann dictionary (updates from 6th edition partially included up to page 274): case, gender and number for nouns, pronouns (columns G–I) number, person, mood, tense and voice for verbs (columns I–M) up to line 109216 case, gender, number, tense and voice for participles (columns G–I, L–M) up to line 109216 absolutives are marked as Abs. in columns N and V Inconsistencies between the original file from Zurich and the Grassmann dictionary as well as internal inconsistencies in Grassmann are noted in column AE, whenever they were noticed. Zurich data was overwritten by conflicting Grassmann data in columns G–M but retained elsewhere. Verb classes according to Whitney (1885) and Jamison (1983) for class 10 in column Y, differences in root spelling between Whitney and Grassmann are noted in column Z. All potential verb classes provided by Whitney are given for every occurrence of the root. Local particles and verbal forms containing them are marked as LP in column AF. Comparatives and superlatives are marked as such in column X and desideratives as Des. in column Y. Modifications made by Anna Fischer (data transformation, technical realisation) to the data: New structure of data table for linguistic annotations with new column titles: A - "VERS_NR": renamed column (from "belege::stelleMMSSSRR") B - "PADA_NR": renamed column (from "belege::pada") C - "PADA_TEXT_LUBOTSKY": renamed column (from "belege::lubotskypada") D - "TOKEN_NR_VERS": renamed column (from "belege::wortnummer rc") E - "TOKEN_NR_PADA": renamed column (from "belege::wortnummer pada") F - "FORM": renamed column (from "belege::form") G - "KASUS": renamed column (from "belege::kasus") H - "GENUS": renamed column (from "belege::genus") I - "NUMERUS": renamed column (from "belege::numerus") J - "PERSON": renamed column (from "belege::person") K - "TEMPUS": moved and renamed column (from L "belege::tempus") L - "PRAESENSKLASSE": created new column for present stem class for each form M - "LEMMA_PRAESENSKLASSEN": created column for present stem classes of respective lemma: Moved and renamed column (from Y "formen::zusätzliche merkmale verb"), moved values "Abs." and "Inf." to column P "INFINIT", moved values "Prek." and "si-Ipv." to column N "MOOD", moved value "Des." to column Q "ABGELEITETE_KONJUGATION": moved value "se-Form" to column W "WEITERE_WERTE" N - "MODUS": moved and renamed column (from K "belege::modus") O - "DIATHESE": moved and renamed column (from M "belege::diathese") P - "INFINIT": created new column for infinite forms "Abs.", "Inf.", "Ptz.", "ta-Ptz.", "na-Ptz." Q - "ABGELEITETE_KONJUGATION": created new column for secondary conjugation "Des.", "Int.", "Kaus." R - "GRADUS": created new column for degree: "Comp.", "Sup." S - "LOKALPARTIKEL": moved and renamed column (from AF "LP") T - "LEMMA_ZÜRICH": moved and renamed column (from AA "lemmata klassisch::lemma") U - "LEMMA_ZÜRICH_LEMMATYP": moved and renamed column (from AB "lemmata klassisch::lemmatyp") V - "LEMMA_ZÜRICH_BEDEUTUNG": moved and renamed column (from AC "lemmata klassisch::bedeutung") W - "WEITERE_WERTE": created new column for all miscellaneous values: e.g. "Hyperchar.", "n-haltig", "se-Form" X - "KOMMENTAR": created new column merging former columns Z "formen::HELPformbestimmung", AD "lemmata klassisch::HELPbedeutung" and AE "anmerkungen abweichungen" Columns that were removed due to redundant information: "formen::zusätzliche merkmale nomen": values "superlative" And "comparative" were renamed "sup." and "comp." and moved to new column R "GRADUS", all other values were moved to new column for miscellaneous W "WEITERE_WERTE" "belege::belegbestimmung summe simpel": values "Ptz.", "ta-Ptz." and "na-Ptz." were moved to new new column P "INFINIT" "belege::kasus bestof" "belege::genus bestof" "belege::numerus bestof" "belege::person bestof" "belege::modus bestof" "belege::tempus bestof" "belege::diathese bestof" "belege::belegbestimmung bestof summe sophistiziert" Revisions and additions made by Antje Casaretto to the data in 2023: F-T: - revision and correction (wherever necessary) of all annotations (books 1-7) G,H,I - disambiguation of case forms, reg. pronouns and nominal forms, if unspecified in Zurich data (books 1-7) L - disambiguation of present stem classes (book 7 and book 1 up to line 21050 vers 01.125.01) M - disambiguation of denominal verbs from primary verbs of the 10th class (books 1-10) N - disambiguation of precative and optative forms wherever possible (books 1-7) Q - new annotations for "Int." (intensives) and "Kaus." (causatives) (books 1-7)
创建时间:
2023-10-10
二维码
社区交流群
二维码
科研交流群
商业服务