five

BroDeadlines/TEST.TDT.mini.tdt_copora_data

收藏
Hugging Face2024-09-12 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/BroDeadlines/TEST.TDT.mini.tdt_copora_data
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: compact features: - name: url dtype: string - name: content dtype: string - name: metadata dtype: string - name: doc_id dtype: string - name: split sequence: string - name: shards dtype: int64 - name: proposition_str sequence: string - name: proposition_list sequence: string splits: - name: train num_bytes: 8205155.707182321 num_examples: 541 download_size: 2782924 dataset_size: 8205155.707182321 - config_name: compact_diemchuan features: - name: url dtype: string - name: content dtype: string - name: metadata dtype: string - name: doc_id dtype: string - name: split sequence: string - name: shards dtype: int64 - name: proposition_list sequence: string splits: - name: train num_bytes: 6491370 num_examples: 548 download_size: 2122758 dataset_size: 6491370 - config_name: default features: - name: url dtype: string - name: content dtype: string - name: doc_id dtype: string - name: metadata dtype: string splits: - name: train num_bytes: 3154318.1803069054 num_examples: 781 download_size: 1121137 dataset_size: 3154318.1803069054 - config_name: diem_chuan features: - name: url dtype: string - name: content dtype: string - name: shards dtype: int64 - name: split sequence: string - name: metadata dtype: string - name: doc_id dtype: string - name: proposition_list sequence: string splits: - name: train num_bytes: 106322 num_examples: 7 download_size: 41773 dataset_size: 106322 configs: - config_name: compact data_files: - split: train path: compact/train-* - config_name: compact_diemchuan data_files: - split: train path: compact_diemchuan/train-* - config_name: default data_files: - split: train path: data/train-* - config_name: diem_chuan data_files: - split: train path: diem_chuan/train-* ---
提供机构:
BroDeadlines
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作