zarnite/reolyy-scene-quality-fixes

Name: zarnite/reolyy-scene-quality-fixes
Creator: zarnite
Published: 2026-04-18 00:07:27
License: 暂无描述

Hugging Face2026-04-18 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/zarnite/reolyy-scene-quality-fixes

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - en license: apache-2.0 annotations_creators: - expert-generated - machine-generated language_creators: - expert-generated - machine-generated source_datasets: - original task_categories: - text-classification - text-generation tags: - zarnite - benchmark - video-understanding - quality-control - creative-tools - gold-track - benchmark-starter pretty_name: Reolyy Scene Quality Fixes size_categories: - 10K<n<100K configs: - config_name: default data_files: - split: train path: data/train.jsonl - split: validation path: data/validation.jsonl - split: test path: data/test.jsonl --- # Reolyy Scene Quality Fixes ## Dataset Description Clip-level scene boundaries, quality problems, severity labels, and correction chains. ## Team Attribution This dataset was created and reviewed by the Zarnite team through internal benchmark design, generation, and quality-control workflows. It should be presented as a Zarnite-authored benchmark starter pack, not as a purely human-collected field corpus. ## Ecosystem Need Tier High Ecosystem Need ## Why This Category Is Attractive Scene-quality correction is a practical creator-tool task with measurable outputs, but public benchmarks often stay too shallow to reflect real finishing decisions. ## Benchmark Goal Evaluate issue identification, severity judgment, correction ordering, and the ability to preserve trust and naturalness while fixing visual defects. ## Included In This Folder - `data/train.jsonl`, `data/validation.jsonl`, `data/test.jsonl`: starter benchmark splits with 1200 total rows. - `schema.json`: JSON Schema for row validation. - `benchmark_spec.json`: metrics, quality gates, and target release scale. - `LICENSE.md`: folder-local license notice for self-contained publishing. - `PUBLISHING.md`: repo-specific publish instructions for Hugging Face. - `hf_repo_template.json`: machine-readable repo template used by the uploader script. ## Target Public Scale - Train: 50,000 - Validation: 5,000 - Test: 5,000 - Total target rows: 60,000 ## Recommended Metrics - `issue_label_accuracy` - `boundary_f1` - `severity_calibration` - `repair_plan_quality` - `preservation_quality` ## Gold-Track Benchmark Assets - `ANNOTATION_GUIDELINES.md`: how to expand rows without drifting from the benchmark purpose. - `REVIEW_PROTOCOL.md`: how to audit validation and test rows with dual review and adjudication. - `BASELINE_EVAL_SPEC.json`: expected output contract, slice reporting, and release thresholds. - `RELEASE_CHECKLIST.md`: final pre-publish checks for the public Hugging Face release. - `SCORING_PROFILE.json`: prediction keys, scoring expectations, and slice reporting requirements. - `prediction_template.jsonl`: starter template for benchmark submissions or baseline runs. ## Expanded Row Anatomy - `source_intent`: the look, goal, and viewer failure condition for the clip. - `camera_report`: practical operating notes that affect how conservative the correction should be. - `reference_fix_settings`: ordered finishing guidance rather than a flat list of possible fixes. - `failure_modes` and `acceptance_checks`: what can go wrong and how to know the clip is actually ready to ship. - `difficulty_rationale`: why the row belongs in its difficulty bucket instead of a weaker slice. - `benchmark_slices`: named reporting slices such as approval friction, proof preservation, or citation traps. - `adversarial_features`, `expected_failure_modes`, and `review_readiness`: what the row is testing and how a gold-track reviewer should treat it. - `evidence_manifest`, `reference_variants`, and `negative_examples`: the source evidence boundary, acceptable alternate answers, and concrete failure cases. ## Hugging Face Deployment This folder is self-contained and can be uploaded as its own Hugging Face dataset repository. - Suggested repo id: `zarnite/reolyy-scene-quality-fixes` - Example upload command: `python upload_to_huggingface.py --dataset-folder "push/high-ecosystem-need/Reolyy-Scene-Quality-Fixes" --repo-id "zarnite/reolyy-scene-quality-fixes"` - You can swap the namespace by passing `--namespace YOUR_USERNAME` to the uploader. ## Local Evaluation - Example eval command: `python run_priority_eval.py --dataset-folder "push/high-ecosystem-need/Reolyy-Scene-Quality-Fixes" --splits validation test` - `prediction_template.jsonl` gives the required output shape for local or leaderboard-style submissions. ## License This package is marked `apache-2.0`. The rows in this folder are original starter examples for benchmark packaging.

提供机构：

zarnite

5,000+

优质数据集

54 个

任务类型

进入经典数据集