zarnite/zarn-meeting-to-actions
收藏Hugging Face2026-04-18 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/zarnite/zarn-meeting-to-actions
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
license: apache-2.0
annotations_creators:
- expert-generated
- machine-generated
language_creators:
- expert-generated
- machine-generated
source_datasets:
- original
task_categories:
- summarization
- text-generation
tags:
- zarnite
- benchmark
- meeting-notes
- action-items
- productivity
- gold-track
- benchmark-starter
pretty_name: Zarn Meeting to Actions
size_categories:
- 10K<n<100K
configs:
- config_name: default
data_files:
- split: train
path: data/train.jsonl
- split: validation
path: data/validation.jsonl
- split: test
path: data/test.jsonl
---
# Zarn Meeting to Actions
## Dataset Description
Meeting transcripts and notes mapped to summaries, decisions, owners, deadlines, and follow-up drafts.
## Team Attribution
This dataset was created and reviewed by the Zarnite team through internal benchmark design, generation, and quality-control workflows. It should be presented as a Zarnite-authored benchmark starter pack, not as a purely human-collected field corpus.
## Ecosystem Need Tier
High Ecosystem Need
## Why This Category Is Attractive
Meeting data is useful across almost every business workflow, but strong public benchmarks still underserve realistic ownership, blocker capture, and follow-up quality.
## Benchmark Goal
Evaluate whether a model can extract the real decisions, preserve unresolved tensions, and produce follow-up artifacts that sound consistent with the meeting.
## Included In This Folder
- `data/train.jsonl`, `data/validation.jsonl`, `data/test.jsonl`: starter benchmark splits with 1200 total rows.
- `schema.json`: JSON Schema for row validation.
- `benchmark_spec.json`: metrics, quality gates, and target release scale.
- `LICENSE.md`: folder-local license notice for self-contained publishing.
- `PUBLISHING.md`: repo-specific publish instructions for Hugging Face.
- `hf_repo_template.json`: machine-readable repo template used by the uploader script.
## Target Public Scale
- Train: 18,000
- Validation: 2,000
- Test: 2,000
- Total target rows: 22,000
## Recommended Metrics
- `action_item_f1`
- `summary_faithfulness`
- `owner_accuracy`
- `followup_consistency`
- `blocker_preservation`
## Gold-Track Benchmark Assets
- `ANNOTATION_GUIDELINES.md`: how to expand rows without drifting from the benchmark purpose.
- `REVIEW_PROTOCOL.md`: how to audit validation and test rows with dual review and adjudication.
- `BASELINE_EVAL_SPEC.json`: expected output contract, slice reporting, and release thresholds.
- `RELEASE_CHECKLIST.md`: final pre-publish checks for the public Hugging Face release.
- `SCORING_PROFILE.json`: prediction keys, scoring expectations, and slice reporting requirements.
- `prediction_template.jsonl`: starter template for benchmark submissions or baseline runs.
## Expanded Row Anatomy
- `pre_read_context`: the notes and tensions in play before the meeting starts.
- `transcript`: a realistic multi-party discussion with tradeoffs and blocker handling.
- `decision_tensions`: what the team had to balance before arriving at the final decision.
- `reference_followup_package`: external recap, internal note, and risk-aware follow-up artifacts.
- `difficulty_rationale`: why the row belongs in its difficulty bucket instead of a weaker slice.
- `benchmark_slices`: named reporting slices such as approval friction, proof preservation, or citation traps.
- `adversarial_features`, `expected_failure_modes`, and `review_readiness`: what the row is testing and how a gold-track reviewer should treat it.
- `evidence_manifest`, `reference_variants`, and `negative_examples`: the source evidence boundary, acceptable alternate answers, and concrete failure cases.
## Hugging Face Deployment
This folder is self-contained and can be uploaded as its own Hugging Face dataset repository.
- Suggested repo id: `zarnite/zarn-meeting-to-actions`
- Example upload command: `python upload_to_huggingface.py --dataset-folder "push/high-ecosystem-need/Zarn-Meeting-to-Actions" --repo-id "zarnite/zarn-meeting-to-actions"`
- You can swap the namespace by passing `--namespace YOUR_USERNAME` to the uploader.
## Local Evaluation
- Example eval command: `python run_priority_eval.py --dataset-folder "push/high-ecosystem-need/Zarn-Meeting-to-Actions" --splits validation test`
- `prediction_template.jsonl` gives the required output shape for local or leaderboard-style submissions.
## License
This package is marked `apache-2.0`. The rows in this folder are original starter examples for benchmark packaging.
提供机构:
zarnite



