five

Datenbank Gesprochenes Deutsch

收藏
re3data.org2024-05-31 收录
下载链接:
https://www.re3data.org/repository/r3d100000004
下载链接
链接失效反馈
官方服务:
资源简介:
The "Database for Spoken German (DGD)" is a corpus management system in the program area Oral Corpora of the Institute for German Language (IDS). It has been online since the beginning of 2012 and since mid-2014 replaces the spoken German database, which was developed in the "Deutsches Spracharchiv (DSAv)" of the IDS. After single registration, the DGD offers external users a web-based access to selected parts of the collection of the "Archive Spoken German (AGD)" for use in research and teaching. The selection of the data for external use depends on the consent of the respective data provider, who in turn must have the appropriate usage and exploitation rights. Also relevant to the selection are certain protection needs of the archive. The Archive for Spoken German (AGD) collects and archives data of spoken German in interactions (conversation corpora) and data of domestic and non-domestic varieties of German (variation corpora). Currently, the AGD hosts around 50 corpora comprising more than 15000 audio and 500 video recordings amounting to around 5000 hours of recorded material with more than 7000 transcripts. With the Research and Teaching Corpus of Spoken German (FOLK) the AGD is also compiling an extensive German conversation corpus of its own. Access to data of Datenbank Gesprochenes Deutsch (DGD) is also provided by: IDS Repository https://www.re3data.org/repository/r3d100010382

《德语口语数据库(DGD)》系德国语言研究所(IDS)口语语料库项目区域内的语料库管理系统。自2012年初上线以来,自2014年中期起,该系统已取代由IDS的德国语言档案馆(DSAv)开发的德语口语数据库。经过单一注册后,DGD向外部用户提供基于网络的访问权限,以便于研究及教学目的使用“口语德语档案(AGD)”精选部分藏品。外部数据的选择需得到相应数据提供者的同意,而这些数据提供者必须拥有相应的使用和利用权利。此外,档案的保护需求也是选择数据的相关因素。口语德语档案(AGD)搜集并存档德语口语数据,包括互动(对话语料库)以及国内及国外德语变体(变体语料库)的数据。目前,AGD拥有约50个语料库,包括超过15000小时的音频和500小时的视频录制,总计约5000小时的录音材料,以及超过7000份转录。此外,AGD还通过《德语口语研究教学语料库(FOLK)》编制了一个庞大的德语对话语料库。数据库 Gesprochenes Deutsch(DGD)的数据访问亦可通过以下途径获得:IDS 存档库 https://www.re3data.org/repository/r3d100010382
提供机构:
Database for Spoken German
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
Datenbank Gesprochenes Deutsch是一个专注于德语口语的语料库管理系统,包含大量音频和视频记录,主要用于研究和教学。访问该数据集需要注册,并可能需要支付费用。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作