JackyHoCL/Cantonese_Dataset
收藏Hugging Face2025-10-23 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/JackyHoCL/Cantonese_Dataset
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含音频和文本数据的粤语数据集,基于kennychan-6/Cantonese_Dataset的转录版本,使用了JackyHoCL/whisper-large-v3-turbo-cantonese-noise-detection模型进行转录。数据集包含一个训练集,共有1900个示例,总大小约为582,368,856.4字节。
This is a Cantonese dataset consisting of audio and text data, based on the transcribed version of kennychan-6/Cantonese_Dataset, transcribed using the JackyHoCL/whisper-large-v3-turbo-cantonese-noise-detection model. The dataset includes a training set with a total of 1900 examples, totaling approximately 582,368,856.4 bytes in size.
提供机构:
JackyHoCL



