MikeGreen2710/s1
收藏Hugging Face2024-04-19 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/MikeGreen2710/s1
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: index
dtype: string
- name: text
dtype: string
- name: updated_at
dtype: timestamp[ns]
- name: post_date
dtype: string
- name: address
dtype: string
- name: NUM
sequence: string
- name: LAN
sequence: string
- name: DIS
sequence: string
- name: LEG
sequence: string
- name: WAR
sequence: string
- name: STR
sequence: string
- name: CIT
sequence: string
- name: LOC
sequence: string
- name: RWD
sequence: string
- name: SHP
sequence: string
- name: LIV
sequence: string
- name: FDR
sequence: string
- name: PUR
sequence: string
- name: ARA
sequence: string
- name: FWD
sequence: string
- name: CAR
sequence: string
- name: STU
sequence: string
- name: COR
sequence: string
- name: NOF
sequence: string
- name: NOBA
sequence: string
- name: PRI
sequence: string
- name: YCT
sequence: string
- name: RPI
sequence: string
- name: NOBR
sequence: string
- name: STU_std
sequence: string
- name: SHP_std
sequence: string
- name: COR_std
sequence: string
- name: NOF_std
dtype: float64
- name: tum_san_thuong
dtype: int64
- name: ham
dtype: int64
- name: lung
dtype: int64
- name: tret
dtype: int64
- name: NOF_so_tang
dtype: float64
- name: NOF_so_lau
dtype: float64
- name: NOF_so_tang_noi
dtype: float64
splits:
- name: train
num_bytes: 950620514
num_examples: 800000
download_size: 432341759
dataset_size: 950620514
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
MikeGreen2710
原始信息汇总
数据集概述
数据集特征
- index: 字符串类型
- text: 字符串类型
- updated_at: 时间戳类型,精确到纳秒
- post_date: 字符串类型
- address: 字符串类型
- NUM: 字符串序列类型
- LAN: 字符串序列类型
- DIS: 字符串序列类型
- LEG: 字符串序列类型
- WAR: 字符串序列类型
- STR: 字符串序列类型
- CIT: 字符串序列类型
- LOC: 字符串序列类型
- RWD: 字符串序列类型
- SHP: 字符串序列类型
- LIV: 字符串序列类型
- FDR: 字符串序列类型
- PUR: 字符串序列类型
- ARA: 字符串序列类型
- FWD: 字符串序列类型
- CAR: 字符串序列类型
- STU: 字符串序列类型
- COR: 字符串序列类型
- NOF: 字符串序列类型
- NOBA: 字符串序列类型
- PRI: 字符串序列类型
- YCT: 字符串序列类型
- RPI: 字符串序列类型
- NOBR: 字符串序列类型
- STU_std: 字符串序列类型
- SHP_std: 字符串序列类型
- COR_std: 字符串序列类型
- NOF_std: 浮点数类型
- tum_san_thuong: 整数类型
- ham: 整数类型
- lung: 整数类型
- tret: 整数类型
- NOF_so_tang: 浮点数类型
- NOF_so_lau: 浮点数类型
- NOF_so_tang_noi: 浮点数类型
数据集分割
- train: 训练集
- 数据量: 950620514 字节
- 样本数: 800000
数据集大小
- 下载大小: 432341759 字节
- 数据集大小: 950620514 字节



