five

avewright/exp085-parallel-multipv-harvest

收藏
Hugging Face2026-03-31 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/avewright/exp085-parallel-multipv-harvest
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: exp085 Parallel MultiPV Harvest task_categories: - text-generation language: - en license: mit size_categories: - 100K<n<1M --- # exp085 Parallel MultiPV Harvest This dataset is a frozen export of the `exp085_parallel_multipv_harvest.py` data harvester. ## Snapshot - Export date: `2026-03-31` - Shards: `44` - Records: `224191` - Size: `551330268` bytes of JSONL shard data - Final shard: `positions_000044.jsonl` - Final shard records: `2724` ## Included Files - `dataset/positions_*.jsonl` - `manifest.json` - `status.json` - `exp085.log` - `stdout.log` - `seen_positions.sqlite` ## Notes - The JSONL line count is treated as the canonical record count for this export. - `status.json` counters and SQLite dedupe counts may differ slightly from the shard line count because they were written on different update boundaries during harvesting. - This export was frozen after stopping the live harvester and checkpointing SQLite WAL state. ## Provenance - Harvester script: `experiments/exp085_parallel_multipv_harvest.py` - Stockfish-backed MultiPV labeling - Deduplication via SQLite position index
提供机构:
avewright
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作