five

oncody/AI_Agent_Task_Dataset

收藏
Hugging Face2026-04-03 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/oncody/AI_Agent_Task_Dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - text-generation - text-classification - token-classification - question-answering - table-question-answering - summarization - fill-mask - text-to-speech - automatic-speech-recognition - audio-classification - translation - zero-shot-classification - text-to-audio - depth-estimation - object-detection - feature-extraction language: - en tags: - ai-agents - synthetic - tool-use - procedural-generation - RahulChaube1 - EverestQ size_categories: - 10G<n<100G --- # 🤖 Massive AI Agent Task Dataset (10.5GB) <div align="center"> ![Dataset Size](https://img.shields.io/badge/Dataset_Size-10.5_GB-2ea44f?style=for-the-badge&logo=huggingface&logoColor=white) ![Format](https://img.shields.io/badge/Format-JSONL-blue?style=for-the-badge) ![License](https://img.shields.io/badge/License-MIT-blue.svg?style=for-the-badge) ![Curator](https://img.shields.io/badge/Curator-Rahul_Chaube-orange?style=for-the-badge) </div> --- ## 📌 Overview Welcome to the **AI Agent Task Dataset**, a massive **10.5GB procedural dataset** designed for training, fine-tuning, and evaluating **autonomous AI agents and LLMs**. This dataset focuses on: - Multi-step reasoning - Tool usage (APIs, frameworks, systems) - Real-world execution workflows Perfect for building **agentic AI systems, copilots, and automation models**. --- ## 📑 Table of Contents - Dataset Details - Dataset Structure - Tech Stack & Tool Coverage - How to Use (Quickstart) - Use Cases & Applications - Dataset Creation & Curation - License & Copyright --- ## 📊 Dataset Details - **Curator:** Rahul Chaube (oncody) - **Format:** `.jsonl` (JSON Lines) - **Size:** ~10.5 GB - **Language:** English - **Scale:** Tens of millions of structured task records --- ## 🏗️ Dataset Structure Each row represents a **complete reasoning workflow**. ### Fields - **goal** → High-level objective - **steps** → Step-by-step execution plan - **tools** → Required technologies/APIs - **output** → Expected result - **reasoning** → Why this approach works --- ### 📌 Example ```json { "goal": "Automate daily price monitoring for top competitors", "steps": [ "Fetch list of competitor URLs from Google Sheets API.", "Scrape HTML content of each URL.", "Parse price elements using CSS selectors.", "Compare scraped prices with the internal product database.", "Send a notification if a competitor's price drops below a defined threshold." ], "tools": [ "Python", "BeautifulSoup", "Google Sheets API", "Slack API" ], "output": "A daily Slack message listing price changes and an updated Google Sheet with current competitor prices.", "reasoning": "Google Sheets acts as an easily updatable database while automation ensures real-time monitoring and alerts." }
提供机构:
oncody
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作