SciDraw-6K: A Multilingual Scientific Illustration Dataset Generated by Google Gemini
收藏DataONE2026-04-20 更新2026-05-19 收录
下载链接:
https://search.dataone.org/view/sha256:531b1cdf485bd0e2f22235690e07ac6af00951b7e896c6964ad6f922080e613a
下载链接
链接失效反馈官方服务:
资源简介:
SciDraw-6K is a curated dataset of 6,291 scientific illustrations synthesized by Google Gemini image-generation models (primarily gemini-3-pro-image-preview and gemini-2.5-flash-image), each paired with aligned prompts in eleven languages: English, Simplified Chinese, Traditional Chinese, Japanese, Korean, German, French, Spanish, Brazilian Portuguese, Italian, and Russian. Images span eight broad scientific categories — biomedical (44.9%), materials (13.4%), AI systems (11.2%), chemistry (9.7%), environment (9.2%), electronics (3.0%), physics (2.2%), and a residual \"other\" bucket (6.3%) covering long-tail disciplines such as robotics, mathematics, economics, civil engineering, and geosciences. Unlike general-purpose text-to-image corpora (LAION-5B, JourneyDB, DiffusionDB) which are dominated by photorealistic and artistic content, SciDraw-6K is purpose-built for the scientific-illustration genre: schematic diagrams, mechanism figures, table-of-contents graphical abstracts, and conceptual posters. The dataset is constructed via a domain-specific prompt taxonomy, Gemini image generation, LLM-based translation, and lightweight quality control. Each row of the metadata contains: a stable image ID, the public image URL, the file extension, the category label, the eleven multilingual prompts, the Gemini model identifier, the generation type, the creation timestamp, and the SHA-256 hash of the downloaded image bytes. Intended uses include: multilingual text-to-image research, domain-adapted diffusion fine-tuning, prompt-engineering studies for scientific visualization, retrieval-augmented generation for scientific figure synthesis, and benchmarking frontier image-generation models on the specialized visual grammar of science. This Dataverse record archives the dataset metadata and documentation. The full image payload (~19 GB) is hosted on Hugging Face (https://huggingface.co/datasets/SciDrawAI/SciDraw-6K) and archived on Zenodo (DOI: https://doi.org/10.5281/zenodo.19642870). Construction scripts are available at https://github.com/SciDrawAI/scidraw-6k.
创建时间:
2026-04-23



