five

huzey/claude-skills

收藏
Hugging Face2026-04-06 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/huzey/claude-skills
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: name dtype: large_string - name: description dtype: large_string - name: full_content dtype: large_string - name: repo dtype: large_string - name: split dtype: large_string - name: qwen3emb_description list: float64 - name: gpt_domain dtype: large_string - name: gpt_capability dtype: large_string - name: gpt_use_case dtype: large_string - name: gpt_triggers_constraints dtype: large_string - name: domain_category dtype: large_string - name: domain_subcategory dtype: large_string - name: qwen3emb_full_content list: float32 - name: qwen3emb_gpt_domain list: float32 - name: qwen3emb_gpt_capability list: float32 - name: qwen3emb_gpt_use_case list: float32 - name: qwen3emb_gpt_triggers_constraints list: float32 - name: ref_files dtype: large_string - name: skills_sh_id dtype: string - name: skills_sh_total_installs dtype: int64 - name: skills_sh_weekly_installs dtype: int64 - name: github_stars dtype: int64 splits: - name: train num_examples: 22862 configs: - config_name: default data_files: - split: train path: data/train-* --- # Claude Skills Dataset This dataset contains curated SKILL.md files plus generated structured summaries and embeddings. ## Columns - `name`: skill name (from SKILL.md frontmatter) - `description`: short description (from SKILL.md frontmatter) - `full_content`: full SKILL.md content (includes frontmatter metadata) - `repo`: source repository - `split`: dataset split label - `qwen3emb_description`: embedding vector for `description` (float list) - `gpt_domain`: concise domain label extracted from the skill content - `gpt_capability`: one-sentence capability summary - `gpt_use_case`: one-sentence use-case summary - `gpt_triggers_constraints`: triggers/constraints if present, else empty string - `domain_category`: broad rule-based category from `gpt_domain` (tokenized, rule-based) - `domain_subcategory`: coarse subcategory within `domain_category` (rule-based; `Other` fallback) - `qwen3emb_full_content`: embedding vector for `full_content` after stripping YAML frontmatter (`--- ... ---`) - `qwen3emb_gpt_domain`: embedding vector for `gpt_domain` - `qwen3emb_gpt_capability`: embedding vector for `gpt_capability` - `qwen3emb_gpt_use_case`: embedding vector for `gpt_use_case` - `qwen3emb_gpt_triggers_constraints`: embedding vector for `gpt_triggers_constraints` - `ref_files`: inlined reference docs from the skill folder (path + `---` + content blocks), may be empty - `skills_sh_id`: canonical `skills.sh` id guess (`{repo}/{slugified(name)}`) - `skills_sh_total_installs`: installs count from skills.sh search index (leaderboard "Installs") - `skills_sh_weekly_installs`: weekly installs from the skills.sh skill page ("Weekly Installs") - `github_stars`: GitHub stars shown on the skills.sh skill page (repo-level; repeated across skills in same repo) ## Source - Crawled from skills.sh, 2026/04/02. Enriched with skills.sh installs + GitHub stars, 2026/04/06.
提供机构:
huzey
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作