Lëtzebuerger Online Dictionnaire (LOD) - Linguistesch Daten
收藏data.public.lu2024-06-14 更新2025-03-26 收录
下载链接:
https://data.public.lu/fr/datasets/letzebuerger-online-dictionnaire-lod-linguistesch-daten/
下载链接
链接失效反馈官方服务:
资源简介:
Complete dataset available via the API of the Lëtzebuerger Online Dictionnaire (LOD, https://lod.lu). Contains all the data on which the site lod.lu is based, with the exception of inflection tables. The latter can be downloaded separately. The file on which the search feature is based can be found here. Important: When using the LOD data, please note the following attributes (Element attribute): EGS ("ëmgangssproochlech" – colloquial) FAM ("graff" – crude) GEHUEW ("gehuewen" – formal) KANNERSPROOCH ("Kannersprooch" – child language) NEOL ("Neologismus" – neologism) PEJ ("pejorativ" – pejorative) VEREELZT ("vereelzt" – archaic) VULG ("vulgär" – vulgar) These attributes indicate the linguistic register and specify, for example, if a certain term is considered outdated or pejorative/derogatory/offensive. The infobox contents with the label important (infobox/@label="important") indicate that a word requires special consideration. The audio files are available at https://lod.lu/uploads/AAC/ (in .m4a format) or https://lod.lu/uploads/OGG/ (in .ogg format). The file number of all articles corresponds to their respective ID (entry/@id) in lowercase letters. All recorded example sentences have an ID (example/@id). The audio files can be downloaded via https://lod.lu/uploads/examples/AAC/ (.m4a format) or https://lod.lu/uploads/examples/OGG/ (.ogg format). Please note that these folders are further divided into sub-folders. Each sub-folder name corresponds to the first two characters of its respective ID. For example, the sentences "fiert dëse Bus op Altwis?" has the ID ffe70832cd99be656a5177023a680a7b. The audio file can be found at https://lod.lu/uploads/examples/AAC/ff/ffe70832cd99be656a5177023a680a7b.m4a or at https://lod.lu/uploads/examples/OGG/ff/ffe70832cd99be656a5177023a680a7b.ogg. Kompletten Datesaz, mat deem d'API vum Lëtzebuerger Online Dictionnaire (LOD, https://lod.lu) alimentéiert gëtt. Enthält alleguerten d'Donnéeën, op deenen de Site lod.lu berout, mat Ausnam vun de Flexiounstabellen. Déi kann een sech hei separat eroflueden. De Fichier, op deem d'Sich berout, fënnt een hei. Beuecht wgl. beim Benotze vun den LOD-Daten onbedéngt follgend Attributer (Element attribute): EGS ("ëmgangssproochlech") FAM ("graff") GEHUEW ("gehuewen") KANNERSPROOCH ("Kannersprooch") NEOL ("Neologismus") PEJ ("pejorativ") VEREELZT ("vereelzt") VULG ("vulgär") Dës Attributer beschreiwen de Sproochregëster a weisen z. B. op e vereelzten oder op en ofwäertende Gebrauch hin. D'Infobox-Inhalter mam Label important (infobox/@label="important") informéieren driwwer, wann ee bei Wierder soll besonnesch Uecht dinn. D'Audiodateie stinn ënner https://lod.lu/uploads/AAC/ (am .m4a-Format), respektiv https://lod.lu/uploads/OGG/ (am .ogg-Format). De Fichiersnumm entsprécht fir all Artikel der ID (entry/@id) a klenge Buschtawen. Fir den Artikel Kaweechelchen z. B. https://lod.lu/uploads/AAC/kaweechelchen2.m4a, resp. https://lod.lu/uploads/OGG/kaweechelchen2.ogg. D'Beispiller, fir déi et eng Audio-Opnam gëtt, hunn eng ID (example/@id). Et kann een d'Audiodateien eroflueden ënner https://lod.lu/uploads/examples/AAC/ (am .m4a-Format), respektiv https://lod.lu/uploads/examples/OGG/ (am .ogg-Format). Déi Dossiere sinn nach weider ënnerdeelt: Den Ënnerdossiers-Numm entsprécht deenen zwee éischten Zeeche vun der ID. Z. B. huet de Saz „fiert dëse Bus op Altwis?“ d'ID ffe70832cd99be656a5177023a680a7b. D'Audiodatei steet ënner https://lod.lu/uploads/examples/AAC/ff/ffe70832cd99be656a5177023a680a7b.m4a, resp. https://lod.lu/uploads/examples/OGG/ff/ffe70832cd99be656a5177023a680a7b.ogg.
本数据集可通过卢森堡在线词典(Lëtzebuerger Online Dictionnaire,简称 LOD,网址:https://lod.lu)的API获取。该数据集包含了构建lod.lu网站所依托的全部数据,但不含屈折变化表。屈折变化表可单独下载。基于搜索功能的文件可在此处找到。重要提示:在使用LOD数据时,请注意以下属性(元素属性):EGS(“ëmgangssproochlech”——口语)FAM(“graff”——粗俗)GEHUEW(“gehuewen”——正式)KANNERSPROOCH(“Kannersprooch”——儿童语言)NEOL(“Neologismus”——新词)PEJ(“pejorativ”——贬义)VEREELZT(“vereelzt”——古旧)VULG(“vulgär”——俚语)。这些属性描述了语言的语域,例如,指示某个术语是否被认为是过时或贬义/侮辱性的。标签为“important”(infobox/@label="important")的方框内容表明该单词需要特别注意。音频文件可在https://lod.lu/uploads/AAC/(.m4a格式)或https://lod.lu/uploads/OGG/(.ogg格式)找到。所有文章的文件编号均对应其各自的ID(entry/@id),且为小写字母。所有记录的例句均具有ID(example/@id)。音频文件可通过https://lod.lu/uploads/examples/AAC/(.m4a格式)或https://lod.lu/uploads/examples/OGG/(.ogg格式)下载。请注意,这些文件夹进一步分为子文件夹。每个子文件夹的名称对应其ID的前两个字符。例如,例句“fiert dëse Bus op Altwis?”的ID为ffee70832cd99be656a5177023a680a7b。相应的音频文件位于https://lod.lu/uploads/examples/AAC/ff/ffe70832cd99be656a5177023a680a7b.m4a或https://lod.lu/uploads/examples/OGG/ff/ffe70832cd99be656a5177023a680a7b.ogg。
提供机构:
data.public.lu



