five

Enggano Flora and Fauna Lexicon

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14553650
下载链接
链接失效反馈
官方服务:
资源简介:
Overview This repository (Wijaya et al., 2024) holds the annotated databases for the Enggano Flora and Fauna Lexicon. The databases, originally stored as Google Spreadsheets for collaboration, are then accessed and processed using R codes in this repository using several R packages (Bryan, 2023; Cysouw, 2018; D'Agostino McGowan & Bryan, 2023; Moran & Cysouw, 2018; Ooms, 2023; Wickham et al., 2019; Wickham & Bryan, 2023). The processing includes creating orthography profile and tokenisation/segmentation of the phonemic transcription, and, most importantly, creating links between the Enggano forms and their corresponding pictures (ID) to be used in the Contemporary Enggano Dictionary, which is processed using R here. The databases consist of four different file types: .rds (R data file), .csv, .tsv, and .xlsx. Dendi Wijaya gathered the primary data in October 2023 and November 2024; transcribed the forms; translated them into Indonesian and English; provided the IPA transcription; and rename the photos according to the ID of the forms. Gede Primahadi W. Rajeg (GPWR) checked which items have been in the contemporary Enggano FLEx databases and which one to exclude from the main dictionary databases (e.g., due to duplication, etc.), in consultation with Engga Zakaria Sangian. GPWR also manually annotated the main entry variable of the forms so that complex forms can be subsumed under/linked to their main/root entry in the dictionary; annotated the crossref. column; performed the segmentation of the IPA transcription (and fixed errors); linked the forms ID with the photo by filename; manage this GitHub repository for archiving. Computational steps involved are documented in this repository. Engga Zakaria Sangian was consulted in a number of meetings for the verification of orthography and inclusion of the forms.
创建时间:
2024-12-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作