Global AI Market Intelligence Index - SAMPLE
收藏Databricks2026-04-09 收录
下载链接:
https://marketplace.databricks.com/details/7053bd39-fa1a-4c44-acc5-05aa905abe8c/AIDC-Inc-_Global-AI-Market-Intelligence-Index---SAMPLE
下载链接
链接失效反馈官方服务:
资源简介:
**Description**
The Global AI Market Intelligence Index is a daily-updated, engineering-grade ledger of the global AI ecosystem. Built directly from the official HuggingFace Model Hub API, this dataset tracks 140,000+ AI models and over 1.7B cumulative downloads—transforming raw repository metadata into actionable market intelligence.
Unlike static scrapes, this index introduces proprietary intelligence signals including a Velocity Score (average daily downloads) to detect breakout trends before mainstream coverage, along with enterprise risk classification based on license and adoption thresholds. The result is a predictive market map that enables investors, data engineers, and ML strategists to identify emerging technologies, mitigate IP risk, and quantify real-world adoption dynamics across the AI economy.
**Provenance**
This dataset is sourced directly from the official HuggingFace Model Hub API (https://huggingface.co/api/models) via an asynchronous high-concurrency ingestion pipeline (Batch ID: HF-2026-FEB). Raw JSON streams are ingested without third-party aggregators.
**Data Lineage**
Direct API ingestion, timestamp normalization to UTC, license mapping to enterprise usage standards, statistical audit validation on the top 1% of assets for parity with live web counters
**Compliance**
PASSED (no critical nulls detected).
GDPR clear. No PII. Augmented dataset.
**License**
CC-4 for SAMPLE
**Acknowledgements**
Certified by AIDC Data Engineering
Kanchana1990, Kaggle Grandmaster
**Use Cases**
This dataset enables high-value analytics and ML workflows on Databricks:
- Predictive Trend Detection: Use Velocity Score signals to identify emerging AI architectures (e.g., quantized models) days before mainstream reporting.
- Enterprise Risk Screening: Filter models by commercial license compliance to reduce IP exposure in production ML pipelines.
- Market Share Intelligence: Analyze adoption shifts between Big Tech, Unicorn, and Indie publishers to guide Edge AI investment strategy.
- Alpha Generation for AI Investing: Build lead–lag and adoption momentum models across daily download dynamics.
- LLM & Task Landscape Mapping: Cluster models by task category to quantify growth in NLP, vision, multimodal, and edge deployments.
-
**Column Dictionary**
- Model_ID (string): Primary key. Unique identifier from HuggingFace source.
- Velocity_Score (integer): Proprietary average daily downloads metric signaling trend strength.
- Publisher_Tier (string): Classification (Big Tech / Unicorn / Indie).
- Enterprise_Ready (string): Risk flag. YES = license commercial & usage threshold met.
- Downloads (integer): Total cumulative adoption count.
- Task (string): Machine learning application category (e.g., NLP).
- Is_Commercial (boolean): True if license permits enterprise usage (Apache 2.0 / MIT).
- Technical Metadata
- Geographic Coverage: Global
- Data Source: Official HuggingFace Model Hub API (Engineering Metadata)
Keywords: AI market intelligence, HuggingFace models, ML adoption data, Velocity Score, enterprise AI compliance, open source AI trends, model download analytics, AI investment signals
提供机构:
AIDC, Inc.



