five

Das Kiezdeutschkorpus (KiDKo)

收藏
DataCite Commons2021-09-21 更新2025-04-16 收录
下载链接:
https://www.fdr.uni-hamburg.de/record/8246
下载链接
链接失效反馈
官方服务:
资源简介:
A multi-modal digital corpus of spontaneous discourse data from informal, oral peer group in multi- and monoethnic speech communities. Multimodales, digitales Korpus spontansprachlicher Gesprächsdaten aus informellen, mündlichen Peer-Group-Situationen in multi- und monoethnischen Sprechergemeinschaften. <strong>CLARIN Metadata summary for Das Kiezdeutschkorpus (KiDKo) (CMDI-based)</strong> <strong>Title: </strong>Das Kiezdeutschkorpus (KiDKo)<br> <strong>Description: </strong> A multi-modal digital corpus of spontaneous discourse data from informal, oral peer group situations in multi- and monoethnic speech communities.<br> <strong>Description: </strong> Multimodales, digitales Korpus spontansprachlicher Gesprächsdaten aus informellen, mündlichen Peer-Group-Situationen in multi- und monoethnischen Sprechergemeinschaften.<br> <strong>Publication date: </strong>2016-11-21<br> <strong>Data owner: </strong> Heike Wiese<br> <strong>Contributors: </strong> Heike Wiese, heike.wiese@uni-potsdam.de) (compiler), Oliver Bunk (compiler), Ulrike Freywald (compiler), Sophie Hamm (compiler), Banu Hueck (compiler), Anne Junghans (compiler), Jana Kiolbassa (compiler), Julia Kostka (compiler), Marlen Leisner (compiler), Nadine Lestmann (compiler), Katharina Mayr (compiler), Tiner Özçelik (compiler), Charlotte Pauli (compiler), Gergana Popova (compiler), Ines Rehbein (compiler), Nadja Reinold (compiler), Franziska Rohland (compiler), Sören Schalowski (compiler), Kathleen Schumann (compiler), Kristina Tjona Sommer (compiler), Emiel Visser (compiler)<br> <strong>Project: </strong> B6: Analysis on the periphery, German Research Foundation (DFG)<br> <strong>Keywords: </strong> spoken language, urban youth language, Kiezdeutsch, Sprachliche Entwicklung im Gegenwartsdeutschen, informeller Sprachgebrauch, Jugendsprache im urbanen Raum, Kiezdeutsch<br> <strong>Language: </strong> German (deu)<br> <strong>Size: </strong> 23 speakers (8 female, 15 male), 270 communications, 270 recordings, 66 hours, 270 transcriptions, 333000 words<br> <strong>Segmentation units: </strong> lexeme<br> <strong>Annotation types: </strong> non-verbal layer, transcription (manual): literary transcription for spoken language/GAT2, n: normalisation (automatic, dictionary lookup)orthographic norminalisation of non-canonical pronunciations, punctuations and capitalisations to Standard German, pos: automated part of speech tagging using adapted SSTS-tagset for spoken language developed for KiDKo, macro: marking of repairs, tr: transcription of Turkish language material, trnorm: norminalisation to Standard Turkish, trdtwwue: literal translation of Turkish to Standard German, trdtue: free translation of Turkish to Standard German<br> <strong>Temporal Coverage: </strong> 2008/2011<br> <strong>Spatial Coverage: </strong> Berlin-Kreuzberg, DE<br> <strong>Genre: </strong> discourse<br> <strong>Modality: </strong> spoken
提供机构:
Universität Hamburg
创建时间:
2020-11-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作