TopStonks, Full Set: Social Buzz, Sentiment, and Raw Data from the Most Popular Stock AND Crypto Forums
收藏Datarade2024-04-19 收录
下载链接:
https://datarade.ai/data-products/topstonks-social-buzz-and-sentiment-data-from-the-most-popul-topstonks
下载链接
链接失效反馈官方服务:
资源简介:
Because we have the largest, most complete data set from the notorious r/wallstreetbets (and 4Chan's /Biz) back to 2019, our data has been regularly featured in the WSJ in stories in the WSJ, Forbes, and other major publications. The latest iteration of our product includes: -API for institutional clients -Crypto -Sentiment analysis: deep-learning-trained using advanced ensemble models Full text and metadata of every post and comment, structured and searchable: -User: u/user on Reddit (with upvotes) -Time: date and time to the second -Text: full body of comment/post -Ticker: stock ticker -Mentions: each time ticker is referenced -Comments: each comment in which ticker is referenced -Posts: posts referencing ticker (highest level) DATA: 10 GB of comment data STORAGE: Housed in a PostgreSQL database REDDIT POSTS: User, Time/Date, Number of comments, type of post, text, score REDDIT COMMENTS: Reddit u/ of the comment/post, date/time, text/body, the comment score, the post that the comment was linked to, and all the associated relational data 4CHAN POSTS: Time/Date, comment, number of child comments (if applicable)
提供机构:
Topstonks
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集收集了自2019年以来来自r/wallstreetbets和4Chan的/Biz等热门股票和加密货币论坛的社交讨论、情感分析及原始数据,包含完整的帖子和评论文本、元数据以及基于深度学习的情感分析。数据量达10GB,存储于PostgreSQL数据库,涵盖用户、时间、股票代码等结构化信息,并支持API访问,适用于机构客户。
以上内容由遇见数据集搜集并总结生成



