PeytonT/repo_code_snippets
收藏Hugging Face2026-04-25 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/PeytonT/repo_code_snippets
下载链接
链接失效反馈官方服务:
资源简介:
---
license: other
task_categories:
- text-generation
language:
- code
pretty_name: Repository Code Snippets
configs:
- config_name: default
data_files:
- split: train
path: parquet/snippets.parquet
- config_name: repo_shards
data_files:
- split: train
path: source_snippets/*.jsonl
---
# Repository Code Snippets
Focused source-code snippets for static repository graph browsing.
Rows are keyed by `repo_id` and `entity_id`, matching the public repo graph
entity ids. Each row contains a focused source span plus surrounding context,
not a complete repository dump.
## Columns
- `repo_id`
- `entity_id`
- `uri`
- `kind`
- `name`
- `path`
- `language`
- `start_line`
- `end_line`
- `context_start_line`
- `context_end_line`
- `code`
- `source_hash`
- `truncated`
## Stats
- snippets: `468,268`
- skipped entities: `437,756`
- repos: `155`
- generated at: `1777152029`
提供机构:
PeytonT



