five

cbrownpinilla/paracrawl-subset

收藏
Hugging Face2026-01-28 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/cbrownpinilla/paracrawl-subset
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: default features: - name: translation struct: - name: en dtype: string - name: zh dtype: string splits: - name: train num_bytes: 2606588373 num_examples: 14170869 download_size: 1945983557 dataset_size: 2606588373 - config_name: en-de features: - name: translation struct: - name: de dtype: string - name: en dtype: string splits: - name: train num_bytes: 57503102445 num_examples: 278316474 download_size: 40777156005 dataset_size: 57503102445 - config_name: en-es features: - name: translation struct: - name: en dtype: string - name: es dtype: string splits: - name: train num_bytes: 77431828813 num_examples: 396509112 download_size: 53542786836 dataset_size: 77431828813 - config_name: en-it features: - name: translation struct: - name: en dtype: string - name: it dtype: string splits: - name: train num_bytes: 26207373297 num_examples: 120122281 download_size: 18319055982 dataset_size: 26207373297 - config_name: en-zh features: - name: translation struct: - name: en dtype: string - name: zh dtype: string splits: - name: train num_bytes: 2606588373 num_examples: 14170869 download_size: 1945983557 dataset_size: 2606588373 configs: - config_name: default data_files: - split: train path: data/train-* - config_name: en-de data_files: - split: train path: en-de/train-* - config_name: en-es data_files: - split: train path: en-es/train-* - config_name: en-it data_files: - split: train path: en-it/train-* - config_name: en-zh data_files: - split: train path: en-zh/train-* ---
提供机构:
cbrownpinilla
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作