URajinda/myanmar_spoken_corpus
收藏Hugging Face2026-01-12 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/URajinda/myanmar_spoken_corpus
下载链接
链接失效反馈官方服务:
资源简介:
## Credits and Acknowledgments
This dataset is built upon the foundational work of the **Myanmar Spoken Corpus** by **freococo** (Wynn).
* **Original Dataset Source:** [freococo/myanmar_spoken_corpus](https://huggingface.co/datasets/freococo/myanmar_spoken_corpus)
* **Modifications:** * Curated and filtered for specific training needs of the ShweYon model.
* Integrated with 3% English subset for bilingual proficiency maintenance.
* Re-formatted into 36 shards for optimized Continued Pre-training (CPT).
We are deeply grateful to **freococo** for their contribution to the Myanmar AI community by providing this high-quality spoken corpus.
提供机构:
URajinda



