Malaysian-Emilia-v2
收藏魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/mesolitica/Malaysian-Emilia-v2
下载链接
链接失效反馈官方服务:
资源简介:
# Malaysian Emilia v2
This version 2 should fixed https://github.com/open-mmlab/Amphion/issues/436, an Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Malaysian and Singaporean Speech Generation. Replicating [Emilia](https://github.com/open-mmlab/Amphion/blob/main/preprocessors/Emilia) on,
## Dataset
### Clone and Extract
We upload as split zip files so you can clone and extract distributedly,
```bash
huggingface-cli download --repo-type dataset \
--include '*.zip' \
--local-dir './' \
--max-workers 20 \
mesolitica/Malaysian-Emilia-v2
wget https://gist.githubusercontent.com/huseinzol05/2e26de4f3b29d99e993b349864ab6c10/raw/9b2251f3ff958770215d70c8d82d311f82791b78/unzip.py
python3 unzip.py
```
## Source code
All source code at https://github.com/mesolitica/Emilia
## Licensing
```
All the videos, songs, images, and graphics used in the video belong to their respective owners and I does not claim any right over them.
Copyright Disclaimer under section 107 of the Copyright Act of 1976, allowance is made for "fair use" for purposes such as criticism, comment, news reporting, teaching, scholarship, education and research. Fair use is a use permitted by copyright statute that might otherwise be infringing.
```
# 马来西亚艾米莉亚v2(Malaysian Emilia v2)
本v2版本修复了https://github.com/open-mmlab/Amphion/issues/436 中提及的问题,是一款面向大规模马来西亚与新加坡语音生成任务的、覆盖全面、多语言且高多样性的语音数据集。本数据集复刻了[艾米莉亚(Emilia)](https://github.com/open-mmlab/Amphion/blob/main/preprocessors/Emilia)的相关设计。
## 数据集
### 克隆与解压
本数据集以分卷压缩包形式上传,支持分布式克隆与解压:
bash
huggingface-cli download --repo-type dataset
--include '*.zip'
--local-dir './'
--max-workers 20
mesolitica/Malaysian-Emilia-v2
wget https://gist.githubusercontent.com/huseinzol05/2e26de4f3b29d99e993b349864ab6c10/raw/9b2251f3ff958770215d70c8d82d311f82791b78/unzip.py
python3 unzip.py
## 源代码
所有源代码可访问地址:https://github.com/mesolitica/Emilia
## 授权许可
本视频中使用的全部视频、音乐、图像及图形素材均归其各自权利人所有,本人不对上述素材主张任何权益。
依据1976年《美国版权法案》第107条,允许出于批评、评论、新闻报道、教学、学术研究及教育目的进行“合理使用”。合理使用是指在版权法规允许范围内的使用行为,此类使用原本可能构成侵权。
提供机构:
maas
创建时间:
2025-10-12



