five

SNuC: The Sheffield Numbers Spoken Language Corpus

收藏
DataCite Commons2024-03-13 更新2024-07-13 收录
下载链接:
https://orda.shef.ac.uk/articles/dataset/SNuC_The_Sheffield_Numbers_Spoken_Language_Corpus/19673772
下载链接
链接失效反馈
官方服务:
资源简介:
SNuC is the first published corpus of spoken alphanumeric identifiers of the sort typically used as serial and part numbers in the manufacturing sector. The dataset contains recordings and transcriptions of over 50 native British English speakers, speaking over 13,000 multi-character alphanumeric sequences and totalling almost 20 hours of recorded speech. <br> Ethical approval to use human participants to gather spoken data using the setup described above was sought and obtained via the University of Sheffield's Research Ethics Review procedures (application 031449).  <br> Please refer to the following paper for more information about this dataset:  Barker, E., Barker, J., Gaizauskas, R., Ma, N., Paramita, M. L. 2022. SNuC: The Sheffield Numbers Spoken Language Corpus. In: Proceedings of LREC 2022 (forthcoming).
提供机构:
The University of Sheffield
创建时间:
2022-04-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作