five

TunSwitch

收藏
魔搭社区2025-12-05 更新2025-06-07 收录
下载链接:
https://modelscope.cn/datasets/MBZUAI/TunSwitch
下载链接
链接失效反馈
官方服务:
资源简介:
This repo contains **diacritized transcription** and wavs of the TunSwitch dataset. #### Acknowledgment This work builds on the existing TunSwitch dataset by providing diacritics for the original transcriptions. #### Citation If you use the diacritized transcripts, please cite these works: ``` @misc{talafha2025nadi2025multidialectalarabic, title={NADI 2025: The First Multidialectal Arabic Speech Processing Shared Task}, author={Bashar Talafha and Hawau Olamide Toyin and Peter Sullivan and AbdelRahim Elmadany and Abdurrahman Juma and Amirbek Djanibekov and Chiyu Zhang and Hamad Alshehhi and Hanan Aldarmaki and Mustafa Jarrar and Nizar Habash and Muhammad Abdul-Mageed}, year={2025}, eprint={2509.02038}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2509.02038}, } @misc{abdallah2023leveragingdatacollectionunsupervised, title={Leveraging Data Collection and Unsupervised Learning for Code-switched Tunisian Arabic Automatic Speech Recognition}, author={Ahmed Amine Ben Abdallah and Ata Kabboudi and Amir Kanoun and Salah Zaiem}, year={2023}, eprint={2309.11327}, archivePrefix={arXiv}, primaryClass={eess.AS}, url={https://arxiv.org/abs/2309.11327}, } ```

本仓库包含TunSwitch数据集的带变音符号的转录文本(diacritized transcription)以及音频波形文件(wavs)。 #### 致谢 本工作基于现有TunSwitch数据集构建,为原始转录文本补充了变音符号。 #### 引用说明 若您使用本带变音符号的转录文本,请引用以下文献: @misc{talafha2025nadi2025multidialectalarabic, title={"NADI 2025:首届多方言阿拉伯语语音处理共享任务"}, author={Bashar Talafha and Hawau Olamide Toyin and Peter Sullivan and AbdelRahim Elmadany and Abdurrahman Juma and Amirbek Djanibekov and Chiyu Zhang and Hamad Alshehhi and Hanan Aldarmaki and Mustafa Jarrar and Nizar Habash and Muhammad Abdul-Mageed}, year={2025}, eprint={2509.02038}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2509.02038}, } @misc{abdallah2023leveragingdatacollectionunsupervised, title={"利用数据采集与无监督学习实现语码转换式突尼斯阿拉伯语自动语音识别(Automatic Speech Recognition)"}, author={Ahmed Amine Ben Abdallah and Ata Kabboudi and Amir Kanoun and Salah Zaiem}, year={2023}, eprint={2309.11327}, archivePrefix={arXiv}, primaryClass={eess.AS}, url={https://arxiv.org/abs/2309.11327}, }
提供机构:
maas
创建时间:
2025-05-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作