five

Texts in “A Grammar of Bulu Puroik”

收藏
Research Data Australia2024-12-14 收录
下载链接:
https://researchdata.edu.au/texts-a-grammar-bulu-puroik/968065
下载链接
链接失效反馈
官方服务:
资源简介:
Annotated audio and video recordings of different genres containing all text examples cited in the PhD dissertation “A Grammar of Bulu Puroik”. The texts included in the grammar are segments of the recordings here. Some recordings contain more than one text. For example, the recording VISITKR contains the texts SULPH, LANG and WOOD. Besides the media files, every item contains annotations as plain org-mode (.org) and XML (.eaf), and as a pdf. The data in the .org, .eaf and .pdf-files is equivalent. *** org *** The .org files are best viewed in GNU emacs with org-mode. For playing the sound files org-player.el is required. However, being plain text, the org-files can be read with any text editor. Videos are currently not played in emacs. *** eaf *** The .eaf files can be viewed with ELAN. They contain following tiers (for each speaker): ref – with a unique reference id composed of the text label and the time (e.g. SAGO00:12), disslabel – the reference id as used in the grammar (often identical to ref), tx – transcription, word – words in phonological orthography, morph – morphemes, gl – glosses of morphemes, morph_id – id of morpheme such as used in in the Bulu Puroik lexicon, ft – free translation, com – comment *** pdf *** Unlike in the other data formats, in the pdf-files every morpheme is linked to a glossary. The pdfs are best viewed with a pdf-viewer which shows a snippet of the link target (e.g. Skim).

带标注的多类型音视频录制素材,涵盖了博士学位论文《布鲁普罗伊克语语法》("A Grammar of Bulu Puroik")中引用的全部文本示例。该语法书收录的文本均源自本次录制素材的片段。部分录制素材可包含多段文本,例如素材VISITKR即收录了SULPH、LANG与WOOD三段文本。除音视频媒体文件外,每个数据条目均附带三类标注文件:纯文本org编辑模式(org-mode)格式(.org)、可扩展标记语言(XML)格式(.eaf)与PDF格式,且.org、.eaf及.pdf三类文件的数据内容完全一致。 *** org *** .org格式文件最适合在搭载org编辑模式(org-mode)的GNU Emacs中打开。若要播放其中的音频文件,还需安装org-player.el插件。因其本质为纯文本文件,.org文件也可通过任意文本编辑器读取。不过当前GNU Emacs暂不支持播放本数据集内的视频文件。 *** eaf *** .eaf格式文件可通过ELAN工具打开查看。此类文件包含针对每位说话人的以下标注层:ref——由文本标签与时间戳组合而成的唯一参考标识(例如SAGO00:12);disslabel——语法书中使用的参考标识(通常与ref完全一致);tx——转写文本;word——采用语音正写法书写的词汇;morph——语素切分结果;gl——语素释义;morph_id——布鲁普罗伊克语词典中使用的语素编号;ft——自由译文;com——注释内容 *** pdf *** 与其他格式的数据不同,.pdf文件中的每个语素均关联至对应的词汇表。这类PDF文件最适合使用可显示链接目标预览片段的PDF阅读器打开(例如Skim)。
提供机构:
PARADISEC
二维码
社区交流群
二维码
科研交流群
商业服务