ytcheng/sm_strategy
收藏Hugging Face2024-05-15 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/ytcheng/sm_strategy
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: title
dtype: string
- name: time
dtype: int64
- name: url
dtype: string
- name: sort
dtype: int64
- name: type
dtype: int64
- name: content
dtype: string
- name: modify_time
dtype: int64
- name: create_time
dtype: int64
- name: tag
dtype: string
- name: from_unixtime(create_time)
dtype: string
- name: id
dtype: int64
- name: content_text
dtype: string
- name: clean_text
dtype: string
- name: augmented_contents
sequence: string
splits:
- name: train
num_bytes: 4455261
num_examples: 228
download_size: 2124585
dataset_size: 4455261
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
The dataset includes multiple features such as title, time, URL, sort, type, content, modify_time, create_time, tag, id, content_text, clean_text, and augmented_contents. The dataset is divided into a training set with 228 samples, totaling 4455261 bytes. The configuration name of the dataset is default, and the data file path is data/train-*.
提供机构:
ytcheng
原始信息汇总
数据集概述
数据集特征
- title:字符串类型
- time:整数类型(int64)
- url:字符串类型
- sort:整数类型(int64)
- type:整数类型(int64)
- content:字符串类型
- modify_time:整数类型(int64)
- create_time:整数类型(int64)
- tag:字符串类型
- from_unixtime(create_time):字符串类型
- id:整数类型(int64)
- content_text:字符串类型
- clean_text:字符串类型
- augmented_contents:字符串序列
数据集划分
- train:
- 示例数量:228
- 数据大小:4455261字节
数据集大小
- 下载大小:2124585字节
- 数据集总大小:4455261字节
配置
- default:
- 训练数据路径:
data/train-*
- 训练数据路径:



