potsawee/emilia-mm-pretrain-fix
收藏Hugging Face2025-12-06 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/potsawee/emilia-mm-pretrain-fix
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
# ALL DATA (default view)
- config_name: "all"
default: true
data_files:
- split: "train"
path: "*/*/*.parquet"
# Language configs (complete list)
- config_name: "EN"
data_files:
- split: "train"
path: "Emilia/EN/*.parquet"
- config_name: "DE"
data_files:
- split: "train"
path: "Emilia/DE/*.parquet"
- config_name: "FR"
data_files:
- split: "train"
path: "Emilia/FR/*.parquet"
- config_name: "ZH"
data_files:
- split: "train"
path: "Emilia/ZH/*.parquet"
- config_name: "JA"
data_files:
- split: "train"
path: "Emilia/JA/*.parquet"
- config_name: "KO"
data_files:
- split: "train"
path: "Emilia/KO/*.parquet"
- config_name: "EN-YODAS"
data_files:
- split: "train"
path: "Emilia-YODAS/EN/*.parquet"
- config_name: "DE-YODAS"
data_files:
- split: "train"
path: "Emilia-YODAS/DE/*.parquet"
- config_name: "FR-YODAS"
data_files:
- split: "train"
path: "Emilia-YODAS/FR/*.parquet"
- config_name: "ZH-YODAS"
data_files:
- split: "train"
path: "Emilia-YODAS/ZH/*.parquet"
- config_name: "JA-YODAS"
data_files:
- split: "train"
path: "Emilia-YODAS/JA/*.parquet"
- config_name: "KO-YODAS"
data_files:
- split: "train"
path: "Emilia-YODAS/KO/*.parquet"
---
# Emilia-Mimi Pretraining (Fix) Dataset
Fix the leading whitespace issue in https://huggingface.co/datasets/potsawee/emilia-mm-pretrain
提供机构:
potsawee



