BroDeadlines/TEST.TDT.mini.tdt_copora_data
收藏Hugging Face2024-09-12 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/BroDeadlines/TEST.TDT.mini.tdt_copora_data
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: compact
features:
- name: url
dtype: string
- name: content
dtype: string
- name: metadata
dtype: string
- name: doc_id
dtype: string
- name: split
sequence: string
- name: shards
dtype: int64
- name: proposition_str
sequence: string
- name: proposition_list
sequence: string
splits:
- name: train
num_bytes: 8205155.707182321
num_examples: 541
download_size: 2782924
dataset_size: 8205155.707182321
- config_name: compact_diemchuan
features:
- name: url
dtype: string
- name: content
dtype: string
- name: metadata
dtype: string
- name: doc_id
dtype: string
- name: split
sequence: string
- name: shards
dtype: int64
- name: proposition_list
sequence: string
splits:
- name: train
num_bytes: 6491370
num_examples: 548
download_size: 2122758
dataset_size: 6491370
- config_name: default
features:
- name: url
dtype: string
- name: content
dtype: string
- name: doc_id
dtype: string
- name: metadata
dtype: string
splits:
- name: train
num_bytes: 3154318.1803069054
num_examples: 781
download_size: 1121137
dataset_size: 3154318.1803069054
- config_name: diem_chuan
features:
- name: url
dtype: string
- name: content
dtype: string
- name: shards
dtype: int64
- name: split
sequence: string
- name: metadata
dtype: string
- name: doc_id
dtype: string
- name: proposition_list
sequence: string
splits:
- name: train
num_bytes: 106322
num_examples: 7
download_size: 41773
dataset_size: 106322
configs:
- config_name: compact
data_files:
- split: train
path: compact/train-*
- config_name: compact_diemchuan
data_files:
- split: train
path: compact_diemchuan/train-*
- config_name: default
data_files:
- split: train
path: data/train-*
- config_name: diem_chuan
data_files:
- split: train
path: diem_chuan/train-*
---
提供机构:
BroDeadlines



