thomasmustier/pi-for-excel-sessions
收藏Hugging Face2026-04-07 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/thomasmustier/pi-for-excel-sessions
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: coding agent session traces
task_categories:
- text-generation
tags:
- agent-traces
- coding-agent
- pi-share-hf
language:
- en
- code
license: other
---
# Coding agent session traces for thomasmustier/pi-for-excel-sessions
This dataset contains redacted coding agent session traces collected while working on [tmustier/pi-for-excel](https://github.com/tmustier/pi-for-excel). The traces were exported with [pi-share-hf](https://github.com/badlogic/pi-share-hf) from a local [pi](https://pi.dev) workspace and filtered to keep only sessions that passed deterministic redaction and LLM review.
## Data description
Each `*.jsonl` file is a redacted pi session. Sessions are stored as JSON Lines files where each line is a structured session entry. Entries include session headers, user and assistant messages, tool results, model changes, thinking level changes, compaction summaries, branch summaries, and custom extension data.
Pi session files are tree-structured via `id` and `parentId`, so a single session file may contain multiple branches of work. See the upstream session format documentation for the exact schema:
- https://github.com/badlogic/pi-mono/blob/main/packages/coding-agent/docs/session.md
Source git repo: [tmustier/pi-for-excel](https://github.com/tmustier/pi-for-excel)
## Redaction and review
The data was processed with [pi-share-hf](https://github.com/badlogic/pi-share-hf) using deterministic secret redaction plus an LLM review step. Deterministic redaction targets exact known secrets and curated credential patterns. The LLM review decides whether a session is about the OSS project, whether it is fit to share publicly, and whether any sensitive content appears to have been missed.
Embedded images may be preserved in the uploaded sessions unless the workspace was initialized with `--no-images`.
## Limitations
This dataset is best-effort redacted. Coding agent transcripts can still contain sensitive or off-topic content, especially if a session mixed OSS work with unrelated private tasks. Use with appropriate caution.
提供机构:
thomasmustier



