zarnite/zarn-meeting-to-actions

Name: zarnite/zarn-meeting-to-actions
Creator: zarnite
Published: 2026-04-18 00:06:46
License: 暂无描述

Hugging Face2026-04-18 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/zarnite/zarn-meeting-to-actions

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - en license: apache-2.0 annotations_creators: - expert-generated - machine-generated language_creators: - expert-generated - machine-generated source_datasets: - original task_categories: - summarization - text-generation tags: - zarnite - benchmark - meeting-notes - action-items - productivity - gold-track - benchmark-starter pretty_name: Zarn Meeting to Actions size_categories: - 10K<n<100K configs: - config_name: default data_files: - split: train path: data/train.jsonl - split: validation path: data/validation.jsonl - split: test path: data/test.jsonl --- # Zarn Meeting to Actions ## Dataset Description Meeting transcripts and notes mapped to summaries, decisions, owners, deadlines, and follow-up drafts. ## Team Attribution This dataset was created and reviewed by the Zarnite team through internal benchmark design, generation, and quality-control workflows. It should be presented as a Zarnite-authored benchmark starter pack, not as a purely human-collected field corpus. ## Ecosystem Need Tier High Ecosystem Need ## Why This Category Is Attractive Meeting data is useful across almost every business workflow, but strong public benchmarks still underserve realistic ownership, blocker capture, and follow-up quality. ## Benchmark Goal Evaluate whether a model can extract the real decisions, preserve unresolved tensions, and produce follow-up artifacts that sound consistent with the meeting. ## Included In This Folder - `data/train.jsonl`, `data/validation.jsonl`, `data/test.jsonl`: starter benchmark splits with 1200 total rows. - `schema.json`: JSON Schema for row validation. - `benchmark_spec.json`: metrics, quality gates, and target release scale. - `LICENSE.md`: folder-local license notice for self-contained publishing. - `PUBLISHING.md`: repo-specific publish instructions for Hugging Face. - `hf_repo_template.json`: machine-readable repo template used by the uploader script. ## Target Public Scale - Train: 18,000 - Validation: 2,000 - Test: 2,000 - Total target rows: 22,000 ## Recommended Metrics - `action_item_f1` - `summary_faithfulness` - `owner_accuracy` - `followup_consistency` - `blocker_preservation` ## Gold-Track Benchmark Assets - `ANNOTATION_GUIDELINES.md`: how to expand rows without drifting from the benchmark purpose. - `REVIEW_PROTOCOL.md`: how to audit validation and test rows with dual review and adjudication. - `BASELINE_EVAL_SPEC.json`: expected output contract, slice reporting, and release thresholds. - `RELEASE_CHECKLIST.md`: final pre-publish checks for the public Hugging Face release. - `SCORING_PROFILE.json`: prediction keys, scoring expectations, and slice reporting requirements. - `prediction_template.jsonl`: starter template for benchmark submissions or baseline runs. ## Expanded Row Anatomy - `pre_read_context`: the notes and tensions in play before the meeting starts. - `transcript`: a realistic multi-party discussion with tradeoffs and blocker handling. - `decision_tensions`: what the team had to balance before arriving at the final decision. - `reference_followup_package`: external recap, internal note, and risk-aware follow-up artifacts. - `difficulty_rationale`: why the row belongs in its difficulty bucket instead of a weaker slice. - `benchmark_slices`: named reporting slices such as approval friction, proof preservation, or citation traps. - `adversarial_features`, `expected_failure_modes`, and `review_readiness`: what the row is testing and how a gold-track reviewer should treat it. - `evidence_manifest`, `reference_variants`, and `negative_examples`: the source evidence boundary, acceptable alternate answers, and concrete failure cases. ## Hugging Face Deployment This folder is self-contained and can be uploaded as its own Hugging Face dataset repository. - Suggested repo id: `zarnite/zarn-meeting-to-actions` - Example upload command: `python upload_to_huggingface.py --dataset-folder "push/high-ecosystem-need/Zarn-Meeting-to-Actions" --repo-id "zarnite/zarn-meeting-to-actions"` - You can swap the namespace by passing `--namespace YOUR_USERNAME` to the uploader. ## Local Evaluation - Example eval command: `python run_priority_eval.py --dataset-folder "push/high-ecosystem-need/Zarn-Meeting-to-Actions" --splits validation test` - `prediction_template.jsonl` gives the required output shape for local or leaderboard-style submissions. ## License This package is marked `apache-2.0`. The rows in this folder are original starter examples for benchmark packaging.

提供机构：

zarnite

5,000+

优质数据集

54 个

任务类型

进入经典数据集