Second DIHARD Challenge Development - SEEDLingS
收藏DataCite Commons2025-06-30 更新2026-05-06 收录
下载链接:
https://datasets.lib.berkeley.edu/citation?persistentId=doi:10.60503/D3/486ED5
下载链接
链接失效反馈官方服务:
资源简介:
Second DIHARD Challenge Development - SEEDLinGS was developed by Duke University and LDC and contains approximately two hours of English child language recordings along with corresponding annotations used in support of the Second DIHARD Challenge. This release, when combined with Second DIHARD Challenge Development - Eleven Sources (LDC2021S10), contains the development set audio data and annotation, except for CHiME-5 audio files, which must be obtained from the University of Sheffield. The DIHARD Challenges are a set of shared tasks on diarization focusing on "hard" diarization; that is, speech diarization for challenging corpora where there was an expectation that existing state-of-the-art systems would fare poorly. As with the first challenge, the second development and evaluation sets were drawn from a diverse sampling of sources including monologues, map task dialogues, broadcast interviews, sociolinguistic interviews, meeting speech, speech in restaurants, clinical recordings, extended child language acquisition recordings, and YouTube videos.
提供机构:
UC Berkeley Library Dataverse
创建时间:
2025-06-30



