fsteig/conversations-30gb
收藏Hugging Face2026-03-05 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/fsteig/conversations-30gb
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
dataset_info:
features:
- name: archived
dtype: string
- name: author
dtype: string
- name: author_fullname
dtype: string
- name: body
dtype: string
- name: comment_type
dtype: string
- name: controversiality
dtype: string
- name: created_utc
dtype: string
- name: edited
dtype: string
- name: gilded
dtype: string
- name: id
dtype: string
- name: link_id
dtype: string
- name: locked
dtype: string
- name: name
dtype: string
- name: parent_id
dtype: string
- name: permalink
dtype: string
- name: retrieved_on
dtype: string
- name: score
dtype: string
- name: subreddit_id
dtype: string
- name: subreddit_name_prefixed
dtype: string
- name: subreddit_type
dtype: string
- name: total_awards_received
dtype: string
- name: source
dtype: string
- name: cataloged_time
dtype: timestamp[s]
- name: channel_id
dtype: string
- name: description
dtype: string
- name: duration
dtype: int64
- name: metadata
struct:
- name: license
dtype: string
- name: provenance
dtype: string
- name: url
dtype: string
- name: published_time
dtype: timestamp[s]
- name: tags
list: string
- name: text
dtype: string
- name: title
dtype: string
splits:
- name: train
num_bytes: 62637388803
num_examples: 85988253
download_size: 31081332075
dataset_size: 62637388803
---
common-pile/youtube_filtered combined with a random sample of 85M items from HuggingFaceGECLM/REDDIT_comments
提供机构:
fsteig



