five

miemiebabu/PhysCoT

收藏
Hugging Face2026-03-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/miemiebabu/PhysCoT
下载链接
链接失效反馈
官方服务:
资源简介:
# PhysCoT: Physics Chain-of-Thought Reasoning Dataset Large-scale dataset for training VLMs on physics reasoning with step-by-step solutions. ## Current Size - Train: 1,887 entries (v0.1 — ScienceQA + MMMU physics) - Val: 209 entries - Scaling to 100K with GPT-4o distillation ## Sources - ScienceQA physics (elementary level, with images) - MMMU Physics (university level, with images) - More sources coming: SciBench, OlympiadBench, GPT-4o generated ## Baseline Results (PhyX testmini) - Qwen2.5-VL-7B (zero-shot): **37.0%** - GPT-4o (reported): 32.5% - GPT-5 (reported): 65.2% ## Format JSONL with fields: - : path to image file - : [{from: human, value: question}, {from: gpt, value: CoT solution}] - : {source, difficulty, domain} ## License Apache 2.0
提供机构:
miemiebabu
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作