five

kotoba-speech/wiki40b_lines_ko

收藏
Hugging Face2025-12-10 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/kotoba-speech/wiki40b_lines_ko
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: shard_01 features: &id001 - name: text dtype: string - name: key dtype: string splits: - name: train num_bytes: 182583824 num_examples: 200000 download_size: 108573914 dataset_size: 182583824 - config_name: shard_02 features: - name: text dtype: string - name: key dtype: string splits: - name: train num_bytes: 181291540 num_examples: 200000 download_size: 107861806 dataset_size: 181291540 - config_name: shard_03 features: - name: text dtype: string - name: key dtype: string splits: - name: train num_bytes: 33694348 num_examples: 37224 download_size: 20120413 dataset_size: 33694348 - config_name: subset_400K features: *id001 splits: - name: train num_examples: 400000 configs: - config_name: shard_01 data_files: - split: train path: shard_01/train-* - config_name: shard_02 data_files: - split: train path: shard_02/train-* - config_name: shard_03 data_files: - split: train path: shard_03/train-* - config_name: subset_400K data_files: - split: train path: - shard_01/train-* - shard_02/train-* ---
提供机构:
kotoba-speech
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作