Gopher-Lab/Genius_Act_Twitter_Example
收藏Hugging Face2025-07-01 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/Gopher-Lab/Genius_Act_Twitter_Example
下载链接
链接失效反馈官方服务:
资源简介:
# 🐦 X-Twitter Scraper: Real-Time Search and Data Extraction Tool
Search and scrape X-Twitter (formerly Twitter) for posts by keyword, account, or trending topics. This no-code tool makes it easy to generate real-time, LLM-ready datasets for any AI or content use case.
Get started with real-time scraping and structure tweet data instantly into clean JSON.
---
## 🚀 Key Features
- ⚡ **Real-Time Fetch** – Stream the latest tweets the moment they’re posted
- 🎯 **Flexible Search** – Filter by keywords, hashtags, cashtags, accounts, or trending topics
- 📈 **Engagement Metrics** – Pull tweet content with likes, replies, reposts & timestamps
- 🧩 **LLM-Ready Output** – Output in clean JSON for agents, RAG, fine-tuning, or analytics
- 💸 **Free Tier** – Up to 100 searches during beta; toggle up to 25 tweets per query
---
## 🛠 How It Works
1. Open the Scraper Tool
2. Enter your query – keyword, `#hashtag`, `@user`, or `$cashtag`
3. Choose tweet count – up to 25 tweets per search
4. Run the search – engine fetches fresh tweets in real time
5. Export the results – copy or download JSON for use in any pipeline
---
## 🔥 Popular Use Cases
- Sentiment and price prediction for crypto or stocks
- Trend discovery & viral content tracking
- News & political monitoring in real time
- Creating LLM training datasets
- Feeding live data to AI agents
- ...and more!
---
## 📂 Dataset Format
Each tweet is structured in LLM-friendly JSON with fields like:
- `username`
- `tweet_id`
- `content`
- `likes`
- `reposts`
- `replies`
- `timestamp`
- `query_used`
---
## 📎 License
**MIT License** – Free to use for research, development, and commercial projects during beta.
---
## ✨ Try It Now
Start Searching & Scraping X-Twitter:
👉 **[Launch Scraper Tool](https://data.masa.ai/x/search)**
Need help? Join the **Masa Discord** in the `#developers` channel.
# 🐦 X-Twitter 爬虫工具:实时搜索与数据提取工具
可基于关键词、账号或热门话题,对X-Twitter(前身为Twitter)平台上的帖文进行搜索与爬取。这款无代码工具可轻松生成适用于各类AI或内容场景的实时、适配大语言模型(LLM)的数据集。
即刻开启实时爬取,可将推文数据快速整理为规范的JSON格式。
---
## 🚀 核心功能
- ⚡ **实时获取** – 实时推送刚发布的最新推文
- 🎯 **灵活搜索** – 可基于关键词、话题标签(hashtag)、现金标签(cashtag)、账号或热门话题进行筛选
- 📈 **互动指标** – 可拉取包含点赞、回复、转发及发布时间戳的推文内容
- 🧩 **适配大语言模型的输出格式** – 输出规范JSON格式,可直接用于AI智能体(AI Agent)、检索增强生成(RAG)、模型微调或数据分析
- 💸 **免费试用层级** – 测试阶段支持最多100次搜索,单次查询可获取最多25条推文
---
## 🛠 工作原理
1. 打开该爬虫工具
2. 输入查询内容 – 可使用关键词、`#hashtag`、`@user`或`$cashtag`
3. 选择推文数量 – 单次搜索最多获取25条推文
4. 启动搜索 – 工具将实时获取最新推文
5. 导出结果 – 复制或下载JSON格式结果,可接入任意数据管道
---
## 🔥 热门应用场景
- 加密货币或股票的情绪与价格预测
- 趋势发现与爆款内容追踪
- 实时新闻与政治动态监测
- 构建大语言模型(LLM)训练数据集
- 为AI智能体(AI Agent)提供实时数据输入
- 以及更多应用场景!
---
## 📂 数据集格式
每条推文均采用适配大语言模型(LLM)的JSON格式进行组织,包含以下字段:
- `username`:用户名
- `tweet_id`:推文ID
- `content`:推文内容
- `likes`:点赞数
- `reposts`:转发数
- `replies`:回复数
- `timestamp`:发布时间戳
- `query_used`:所用查询词
---
## 📎 授权协议
**MIT许可证** – 测试阶段可免费用于研究、开发及商业项目。
---
## ✨ 立即试用
开始搜索并爬取X-Twitter:
👉 **[启动爬虫工具](https://data.masa.ai/x/search)**
需要帮助?请加入Masa Discord的`#developers`频道获取支持。
提供机构:
Gopher-Lab



