SinclairSchneider/tweets_with_suspicious_links
收藏Hugging Face2026-03-01 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/SinclairSchneider/tweets_with_suspicious_links
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: search_term
dtype: string
- name: id
dtype: int64
- name: link
dtype: string
- name: text
dtype: string
- name: name
dtype: string
- name: username
dtype: string
- name: profile_id
dtype: string
- name: avatar
dtype: string
- name: date
dtype: timestamp[ns, tz=UTC]
- name: is-retweet
dtype: bool
- name: is-pinned
dtype: bool
- name: external-link
dtype: string
- name: replying-to
sequence: string
- name: quoted-post
dtype: string
- name: comments
dtype: int64
- name: retweets
dtype: int64
- name: quotes
dtype: int64
- name: likes
dtype: int64
- name: pictures
sequence: string
- name: videos
sequence: string
- name: gifs
sequence: string
- name: lang
dtype: string
splits:
- name: train
num_bytes: 109664049
num_examples: 217439
download_size: 36648829
dataset_size: 109664049
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
数据集信息:
特征字段:
- 字段名:搜索词(search_term),数据类型:字符串
- 字段名:标识符(id),数据类型:64位整型
- 字段名:链接(link),数据类型:字符串
- 字段名:文本内容(text),数据类型:字符串
- 字段名:名称(name),数据类型:字符串
- 字段名:用户名(username),数据类型:字符串
- 字段名:个人资料标识符(profile_id),数据类型:字符串
- 字段名:头像链接(avatar),数据类型:字符串
- 字段名:日期(date),数据类型:带UTC时区的纳秒级时间戳(timestamp[ns, tz=UTC])
- 字段名:是否为转发(is-retweet),数据类型:布尔类型(bool)
- 字段名:是否为置顶帖(is-pinned),数据类型:布尔类型(bool)
- 字段名:外部链接(external-link),数据类型:字符串
- 字段名:回复对象(replying-to),数据类型:字符串序列
- 字段名:引用帖内容(quoted-post),数据类型:字符串
- 字段名:评论数(comments),数据类型:64位整型
- 字段名:转发数(retweets),数据类型:64位整型
- 字段名:引用帖数(quotes),数据类型:64位整型
- 字段名:点赞数(likes),数据类型:64位整型
- 字段名:图片链接(pictures),数据类型:字符串序列
- 字段名:视频链接(videos),数据类型:字符串序列
- 字段名:GIF链接(gifs),数据类型:字符串序列
- 字段名:语言(lang),数据类型:字符串
数据划分:
- 划分名称:训练集(train),占用字节数:109664049,样本数量:217439
下载大小:36648829 字节
数据集总大小:109664049 字节
配置项:
- 配置名称:默认配置(default),数据文件:
- 对应划分:训练集(train),文件路径:data/train-*
提供机构:
SinclairSchneider



