The Hamburg MapTask Corpus (HAMATAC)

Name: The Hamburg MapTask Corpus (HAMATAC)
Creator: Universität Hamburg
Published: 2020-11-06 12:35:35
License: 暂无描述

DataCite Commons2020-11-06 更新2025-04-16 收录

下载链接：

https://www.fdr.uni-hamburg.de/record/1480

下载链接

链接失效反馈

官方服务：

资源简介：

Audio and two video recordings of map tasks with adult L2 users of German and one L1 speaker. The speakers' L1 and their L2 proficiencies vary. The maps used for the tasks are available. The Hamburg MapTask Corpus (HAMATAC) is a spoken language corpus documenting the performance of 24 L2 learners of German in a map task. HAMATAC was recorded and transcribed in project Z2 at the Research Centre on Multilingualism. The current version 0.3 contains a new communication with video recording as well as the resources known from the previous version, e.g. orthographic transcriptions of the recordings, manual annotation of disfluencies and automatic annotation of part-of-speech and lemmas. CLARIN Metadata summary for The Hamburg MapTask Corpus (HAMATAC) (CMDI-based) Title: The Hamburg MapTask Corpus (HAMATAC) Description: Audio and two video recordings of map tasks with adult L2 users of German and one L1 speaker. The speakers' L1 and their L2 proficiencies vary. The maps used for the tasks are available. Publication date: 2010-09-16 Data owner: Hamburger Zentrum für Sprachkorpora, Max-Brauer-Allee 60 / D-22765 Hamburg, corpora@uni-hamburg.de Contributors: Hamburger Zentrum für Sprachkorpora, Max-Brauer-Allee 60 / D-22765 Hamburg, corpora@uni-hamburg.de (compiler) Project: Z2 "Computer Assisted Methods for the creation and analysis of multilingual data", German Research Foundation (DFG) Keywords: adult L2 acquisition, learner corpus, task-oriented communication, successive bilingualism, L2 data, adult bilingualism, simultaneous bilingualism, map task, EXMARaLDA Language: German (deu) Size: 28 speakers (16 female, 12 male), 26 communications, 26 recordings, 208 minutes, 26 transcriptions, 22898 words Annotation types: transcription (manual): orthographic transcription/simplified HIAT, pos: Fine-grained part of speech tagging using TreeTagger and the STTS tagset., pos-sup: superordinate part of Speech (manual, STTS tagset), c: indicates that the automatic pos-annotation is incorrect, lemma: lemma (TreeTagger), disfluency: manual annotation of disfluency phenomena, pho: manual annotation of phonetic phenomena Temporal Coverage: 2009-10-28/2013-06-19 Spatial Coverage: Hamburg, DE Genre: discourse Modality: spoken

提供机构：

Universität Hamburg

创建时间：

2020-09-01

5,000+

优质数据集

54 个

任务类型

进入经典数据集