Enggano Flora and Fauna Lexicon
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14553650
下载链接
链接失效反馈官方服务:
资源简介:
Overview
This repository (Wijaya et al., 2024) holds the annotated databases for the Enggano Flora and Fauna Lexicon. The databases, originally stored as Google Spreadsheets for collaboration, are then accessed and processed using R codes in this repository using several R packages (Bryan, 2023; Cysouw, 2018; D'Agostino McGowan & Bryan, 2023; Moran & Cysouw, 2018; Ooms, 2023; Wickham et al., 2019; Wickham & Bryan, 2023).
The processing includes creating orthography profile and tokenisation/segmentation of the phonemic transcription, and, most importantly, creating links between the Enggano forms and their corresponding pictures (ID) to be used in the Contemporary Enggano Dictionary, which is processed using R here.
The databases consist of four different file types: .rds (R data file), .csv, .tsv, and .xlsx.
Dendi Wijaya gathered the primary data in October 2023 and November 2024; transcribed the forms; translated them into Indonesian and English; provided the IPA transcription; and rename the photos according to the ID of the forms.
Gede Primahadi W. Rajeg (GPWR) checked which items have been in the contemporary Enggano FLEx databases and which one to exclude from the main dictionary databases (e.g., due to duplication, etc.), in consultation with Engga Zakaria Sangian. GPWR also manually annotated the main entry variable of the forms so that complex forms can be subsumed under/linked to their main/root entry in the dictionary; annotated the crossref. column; performed the segmentation of the IPA transcription (and fixed errors); linked the forms ID with the photo by filename; manage this GitHub repository for archiving. Computational steps involved are documented in this repository.
Engga Zakaria Sangian was consulted in a number of meetings for the verification of orthography and inclusion of the forms.
创建时间:
2024-12-25



