Disinformation on YouTube: A dataset of YouTube comments on videos related to claims made by Trump and Vance on Haitian immigrants
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14166027
下载链接
链接失效反馈官方服务:
资源简介:
The corpus contains three files. First, the youtube_haitian_disinformation_videos_meta.csv file includes comments and YouTube video metadata. Data is organized around per video information. The columnar values are:
video_id
date (video publication date)
title
description
channel_title
transcript
transcript_str (Video transcript without timestamps)
views
likes
comments (all comments per video)
Second, the youtube_haitian_disinformation_comment_reply_metadata.csv file includes comments, replies, and comment metadata. Each comment occupies its own row in the spreadsheet. The columnar field are:
video_id
comment
comment_date
comment_like_count
author
comment_id
in_reply_to
neg, neu, pos, compound (VADER polarity scores)
named_entities (spaCy named entity tags with tokens)
emoji (spaCy Emojis and token spans)
Comments and associated metadata are represented in individual rows.
The third file contains the results of the TFIDF analysis described herein. The TFIDF analysis features the top 5000 terms weights for the comments to each video in a .csv file.
youtube_disinfo_comments_tfidf_results_per_video.csv
创建时间:
2024-12-30



