five

The Hamburg MapTask Corpus (HAMATAC)

收藏
DataCite Commons2020-11-06 更新2025-04-16 收录
下载链接:
https://www.fdr.uni-hamburg.de/record/1480
下载链接
链接失效反馈
官方服务:
资源简介:
Audio and two video recordings of map tasks with adult L2 users of German and one L1 speaker. The speakers' L1 and their L2 proficiencies vary. The maps used for the tasks are available. The Hamburg MapTask Corpus (HAMATAC) is a spoken language corpus documenting the performance of 24 L2 learners of German in a map task. HAMATAC was recorded and transcribed in project Z2 at the Research Centre on Multilingualism. The current version 0.3 contains a new communication with video recording as well as the resources known from the previous version, e.g. orthographic transcriptions of the recordings, manual annotation of disfluencies and automatic annotation of part-of-speech and lemmas. <strong>CLARIN Metadata summary for The Hamburg MapTask Corpus (HAMATAC) (CMDI-based)</strong> <strong>Title: </strong>The Hamburg MapTask Corpus (HAMATAC)<br> <strong>Description: </strong>Audio and two video recordings of map tasks with adult L2 users of German and one L1 speaker. The speakers' L1 and their L2 proficiencies vary. The maps used for the tasks are available.<br> <strong>Publication date: </strong>2010-09-16<br> <strong>Data owner: </strong> Hamburger Zentrum für Sprachkorpora, Max-Brauer-Allee 60 / D-22765 Hamburg, corpora@uni-hamburg.de<br> <strong>Contributors: </strong> Hamburger Zentrum für Sprachkorpora, Max-Brauer-Allee 60 / D-22765 Hamburg, corpora@uni-hamburg.de (compiler)<br> <strong>Project: </strong> Z2 "Computer Assisted Methods for the creation and analysis of multilingual data", German Research Foundation (DFG)<br> <strong>Keywords: </strong> adult L2 acquisition, learner corpus, task-oriented communication, successive bilingualism, L2 data, adult bilingualism, simultaneous bilingualism, map task, EXMARaLDA<br> <strong>Language: </strong> German (deu)<br> <strong>Size: </strong> 28 speakers (16 female, 12 male), 26 communications, 26 recordings, 208 minutes, 26 transcriptions, 22898 words<br> <strong>Annotation types: </strong> transcription (manual): orthographic transcription/simplified HIAT, pos: Fine-grained part of speech tagging using TreeTagger and the STTS tagset., pos-sup: superordinate part of Speech (manual, STTS tagset), c: indicates that the automatic pos-annotation is incorrect, lemma: lemma (TreeTagger), disfluency: manual annotation of disfluency phenomena, pho: manual annotation of phonetic phenomena<br> <strong>Temporal Coverage: </strong> 2009-10-28/2013-06-19<br> <strong>Spatial Coverage: </strong> Hamburg, DE<br> <strong>Genre: </strong> discourse<br> <strong>Modality: </strong> spoken
提供机构:
Universität Hamburg
创建时间:
2020-09-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作