NLP-UniBW/tweets_dataset_jan_feb_big_deduplicated
收藏Hugging Face2025-09-18 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/NLP-UniBW/tweets_dataset_jan_feb_big_deduplicated
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: search_term
dtype: string
- name: id
dtype: int64
- name: link
dtype: string
- name: text
dtype: string
- name: name
dtype: string
- name: username
dtype: string
- name: profile_id
dtype: string
- name: avatar
dtype: string
- name: date
dtype: timestamp[ns, tz=UTC]
- name: is-retweet
dtype: bool
- name: is-pinned
dtype: bool
- name: external-link
dtype: string
- name: replying-to
sequence: string
- name: quoted-post
struct:
- name: date
dtype: string
- name: gifs
sequence: string
- name: link
dtype: string
- name: pictures
sequence: string
- name: text
dtype: string
- name: user
struct:
- name: avatar
dtype: string
- name: name
dtype: string
- name: profile_id
dtype: string
- name: username
dtype: string
- name: videos
sequence: 'null'
- name: comments
dtype: int64
- name: retweets
dtype: int64
- name: quotes
dtype: int64
- name: likes
dtype: int64
- name: pictures
sequence: string
- name: videos
sequence: 'null'
- name: gifs
sequence: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 18391874444
num_examples: 30692248
download_size: 10702777054
dataset_size: 18391874444
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
NLP-UniBW



