MikeGreen2710/data_pred_3_models_batch_3
收藏Hugging Face2024-04-18 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/MikeGreen2710/data_pred_3_models_batch_3
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: level_0
dtype: int64
- name: index
dtype: string
- name: text
dtype: string
- name: updated_at
dtype: timestamp[ns]
- name: post_date
dtype: string
- name: address
dtype: string
- name: STR
sequence: string
- name: DIS
sequence: string
- name: LAN
sequence: string
- name: NUM
sequence: string
- name: LOC
sequence: string
- name: WAR
sequence: string
- name: LEG
sequence: string
- name: CIT
sequence: string
- name: FWD
sequence: string
- name: PUR
sequence: string
- name: ARA
sequence: string
- name: RWD
sequence: string
- name: CAR
sequence: string
- name: FDR
sequence: string
- name: SHP
sequence: string
- name: LIV
sequence: string
- name: NOBA
sequence: string
- name: NOBR
sequence: string
- name: COR
sequence: string
- name: YCT
sequence: string
- name: NOF
sequence: string
- name: STU
sequence: string
- name: RPI
sequence: string
- name: PRI
sequence: string
splits:
- name: train
num_bytes: 665495147
num_examples: 600000
download_size: 323158836
dataset_size: 665495147
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
MikeGreen2710
原始信息汇总
数据集概述
数据集特征
- level_0: 数据类型为 int64。
- index: 数据类型为 string。
- text: 数据类型为 string。
- updated_at: 数据类型为 timestamp[ns]。
- post_date: 数据类型为 string。
- address: 数据类型为 string。
- STR, DIS, LAN, NUM, LOC, WAR, LEG, CIT, FWD, PUR, ARA, RWD, CAR, FDR, SHP, LIV, NOBA, NOBR, COR, YCT, NOF, STU, RPI, PRI: 数据类型为 sequence,均为 string 类型。
数据集划分
- train: 包含 600000 个示例,数据量大小为 665495147 字节。
数据集大小
- 下载大小: 323158836 字节。
- 数据集大小: 665495147 字节。
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*



