HeyJay!: A Corpus of Atypical Speech for Spoken Language Understanding and Automatic Speech Recognition, United States, 2023-2024
收藏DataCite Commons2026-04-08 更新2026-05-03 收录
下载链接:
https://www.icpsr.umich.edu/web/ICPSR/studies/39448/versions/V4
下载链接
链接失效反馈官方服务:
资源简介:
HeyJay! is a restricted-access study consisting of speech audio files and associated metadata, including file-level annotations and participant-level information. HeyJay! is a new corpus of atypical speech from participants with neurodegenerative disorders, including Parkinson's Disease, Ataxias, or Amyotrophic Lateral Sclerosis.
The current corpus version contains more than 8,500 utterance recordings encompassing supervised transcriptions and intent annotations. Additionally, it includes speech quality ratings for each participant, performed by three expert speech and language pathologists. This corpus, the first one with intent annotation of atypical speech that is publicly available, is intended to create more fair speech technologies for atypical speakers by adapting and improving the state of the art and to enable further research in the field.
提供机构:
ICPSR - Interuniversity Consortium for Political and Social Research
创建时间:
2026-04-08



