derek-thomas/processed-subset-bestofredditorupdates
收藏Hugging Face2023-12-27 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/derek-thomas/processed-subset-bestofredditorupdates
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
dataset_info:
features:
- name: id
dtype: string
- name: content
dtype: string
- name: score
dtype: int64
- name: date_utc
dtype: timestamp[ns]
- name: title
dtype: string
- name: flair
dtype: string
- name: poster
dtype: string
- name: permalink
dtype: string
- name: updated
dtype: bool
- name: new
dtype: bool
- name: embedding
sequence: float64
splits:
- name: train
num_bytes: 128012062
num_examples: 10355
download_size: 95501729
dataset_size: 128012062
---
# Dataset Card for "processed-subset-bestofredditorupdates"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
derek-thomas
原始信息汇总
数据集概述
数据集配置
- 配置名称: default
- 数据文件:
- 分割: train
- 路径: data/train-*
数据集信息
-
特征:
- id: 字符串
- content: 字符串
- score: 64位整数
- date_utc: 时间戳(纳秒)
- title: 字符串
- flair: 字符串
- poster: 字符串
- permalink: 字符串
- updated: 布尔值
- new: 布尔值
- embedding: 浮点数序列
-
分割:
- 名称: train
- 字节数: 128012062
- 样本数: 10355
-
下载大小: 95501729
-
数据集大小: 128012062



