MikeGreen2710/s2
收藏Hugging Face2024-04-19 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/MikeGreen2710/s2
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: index
dtype: string
- name: text
dtype: string
- name: updated_at
dtype: timestamp[ns]
- name: post_date
dtype: string
- name: address
dtype: string
- name: NUM
sequence: string
- name: LAN
sequence: string
- name: DIS
sequence: string
- name: LEG
sequence: string
- name: WAR
sequence: string
- name: STR
sequence: string
- name: CIT
sequence: string
- name: LOC
sequence: string
- name: RWD
sequence: string
- name: SHP
sequence: string
- name: LIV
sequence: string
- name: FDR
sequence: string
- name: PUR
sequence: string
- name: ARA
sequence: string
- name: FWD
sequence: string
- name: CAR
sequence: string
- name: STU
sequence: string
- name: COR
sequence: string
- name: NOF
sequence: string
- name: NOBA
sequence: string
- name: PRI
sequence: string
- name: YCT
sequence: string
- name: RPI
sequence: string
- name: NOBR
sequence: string
- name: STU_std
sequence: string
- name: SHP_std
sequence: string
- name: COR_std
sequence: string
- name: NOF_std
dtype: float64
- name: tum_san_thuong
dtype: int64
- name: ham
dtype: int64
- name: lung
dtype: int64
- name: tret
dtype: int64
- name: NOF_so_tang
dtype: float64
- name: NOF_so_lau
dtype: float64
- name: NOF_so_tang_noi
dtype: float64
splits:
- name: train
num_bytes: 950565110
num_examples: 800000
download_size: 432438885
dataset_size: 950565110
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
MikeGreen2710
原始信息汇总
数据集概述
数据集特征
- index:字符串类型
- text:字符串类型
- updated_at:时间戳类型,精度到纳秒
- post_date:字符串类型
- address:字符串类型
- NUM, LAN, DIS, LEG, WAR, STR, CIT, LOC, RWD, SHP, LIV, FDR, PUR, ARA, FWD, CAR, STU, COR, NOF, NOBA, PRI, YCT, RPI, NOBR, STU_std, SHP_std, COR_std:字符串序列类型
- NOF_std:浮点数类型
- tum_san_thuong, ham, lung, tret:整数类型
- NOF_so_tang, NOF_so_lau, NOF_so_tang_noi:浮点数类型
数据集分割
- train:训练集,包含800000个样本,数据量大小为950565110字节。
数据集大小
- 下载大小:432438885字节
- 数据集大小:950565110字节



